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SUBTILASES 

FIELD OF THE INVENTION 

The present invention relates to JP170 and BPN' like subtilases and to methods of con- 
struction such variants with altered properties, such as stability (e.g. thermostability or stor- 
5 age stability), Ca^^ dependency, pH dependent activity, improved performance in washing 
and cleaning applications. 

BACKGROUND OF THE INVENTION 

Enzymes have been used within the detergent industry as part of washing formulations for 
10 more than 30 years. Proteases are from a commercial perspective the most relevant en- 
zyme in such formulations, but other enzymes including lipases, amylases, cellulases or 
mixtures of enzymes are also often used. 

To improve the cost and/or the performance of proteases there is an ongoing search for 
proteases wrth altered properties, such as increased activity at low temperatures, increased 
15 thermostability, increased specific activity at a given pH, altered Ca^"^ dependency, in- 
creased stability in the presence of other detergent ingredients (e.g. bleach, surfactants 
etc.) etc. 

The search for proteases with altered properties include both discovery of naturally occur- 
ring proteases, i.e. so called wild-type proteases but also alteration of well-known prote- 
20 ases by e.g. genetic manipulation of the nucleic acid sequence encoding said proteases. 
Knowledge of the relationship between the three-dimensional structure and the function of 
a protein has improved the ability to evaluate which areas of a protein to alter to affect a 
specific characteristic of the protein. 

One family of proteases, which are often used in detergents, are the subtilases. This family 
25 has previously been further grouped into 6 different sub-groups by Siezen RJ and Leunis- 
sen JAM, 1997, Protein Science. 6, 501-523. One of these sub-groups is the Subtilisin fam- 
ily which includes subtilases such as BPN\ subtilisin 309 (SAVINASE®, NOVOZYMES 
A/S). subtilisin Carlsberg (ALCALASE®. NOVOZYMES A/S), subtilisin S41 (a subtilase 
from the psychrophilic Antarctic Bacillus TA41. Davail S et al. 1994, The Journal of Biologi- 
30 cal Chemistry, 269(26), 99. 17448-17453). subtilisin 339 (a subtilase from the psychrophilic 
Antarctic Bacillus TA39, Narinx E et aL 1997, Protein Engineering, 10 (11), pp. 1271-1279) 
and TY145 (a subtilase from Bacillus sp. TY145, NCIMB 40339 described in WO 
92/17577). 

However, despite the sequence homology between the subtilases belonging to the Subtil- 
35 isin subgroup of subtilases, modelling of the three-dimensional structure of one subtilase 
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on the basis of the three-dimensional structure of another subtilase may result in an incor- 
rect three-dimensional model structure because of structural differences. 

Recently the three-dimensional structure of subtilase TY145 have been elucidated and it 
5 was found that there are several differences between this and the three-dimensional struc- 
ture of BPN' also belonging to the Subtilisin subgroup of subtilases (Danish application PA 
2003 00119). 

The differences between the three-dimensional structures of TY145 and BPN' are con- 
firmed by the three-dimensional structure of the subtilase "sphericase" from Bacillus 
10 sphaericus (PDB N0.1EA7, Protein Data Bank). The overall structure and many details of 
this subtilase are very homologous with the TY145 subtilase structure. 

Now the inventors of the present invention disclose the three-dimensional structure of the 
subtilase JP170. This subtilase also has great structural differences compared to BPN' and 
15 TY145. 

The subtilase JP170 and subtilases similar to JP170 are already known in the art, but the 
three-dimensional structure of the present invention has not been disclosed for such subti- 
lases. 

The JP170 subtilase was first described as protease A in WO 88/01293 to Novo Nordisk. 

20 Later the patent application WO 98/56927 to Novo Nordisk Biotech disclosed the amino 
acid (polypeptide) sequence of JP170 and the DNA sequence encoding JP170. The pat- 
ents JP7-62152 and JP 4197182 to Lion Corp. disclosed the alkaline app. 46 kD protease 
Y produced by Bacillus sp, Y that is homologous to JP170 and the DNA sequence encod- 
ing protease Y. And in addition US 6,376,227 to Kao Corp. discloses physical characterise 

25 tics as well as DNA and polypeptide sequences of alkaline proteases KP43, KP1790 and 
KP9860 which are also homologous to JP170. Recently variants of the KP43, KP9860, SD- 
521 and Y proteases among others were disclosed In EP 1209233. These variants have 
the accession numbers aam50090, aam50086, aam50085, aam50084, aam50083, 
aam50082, aam50081. aam50080. These proteases are highly homologous, and an 

30 alignment of KP43. KP9a60, SD-521, Y and JP170 revealed at least 90% homology- 
Therefore JP170, Y (aay44619) and SD-521 (aam50084) represent these proteases in the 
alignments of the present application. 

In the literature, modelling based on three-dimensional structures of proteins has been 
35 used to transfer advantageous properties from one protein to another. Miyaziakt K et al. 
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2000. J Mol Biol, 297, pp.1015-1026 discloses enhancement of the thermostability and ac- 
tivity of the psychrophilic protease subtilisin S41 by methods of directed evolution. 
Wintrode TL et al. 2000, Journal of Biological Chemistry, 275 (41), pp. 31635^31640 dis- 
closes conversion of a mesophilic subtilisin-like protease from Bacillus sphaericus SSI! into 
5 its psychrophilic counterpart by methods of directed evolution. Wintrode et al. constructed 
the three-dimensional structural model of the SSII subtilase on basis of its homology with 
subtiiisins Carlsberg, Savinase, BPN' and Thermitase, but according to the disclosure of 
the three-dimensional structure of the TY145 subtilase, the SSII subtilase pertain to the 
new group of TY145 like subtilases. 

10 

BRIEF DESCRIPTION OF THE INVENTION 

The inventors have modified the amino acid sequence of a subtilase to obtain variants with 
improved properties, based on the three-dimensional structure of the subtilases JP17G and 
BPN'. The variants have altered properties, such as increased activity at low temperatures, 
15 increased thermostability, increased specific activity at a given pH, altered Ca^^ depend- 
ency, increased stability in the presence of other detergent ingredients (e.g. bleach, surfac- 
tants etc.) etc. 

Accordingly, the object of the present invention is to provide a method for constructing sub- 
tilases having altered properties, in particular to provide a method for constructing subti- 
20 lases having altered properties as described above. 

Thus, in its broadest aspect, the present invention relates to a method for constructing a 
variant of a parent subtilase, wherein the variant has at least one altered property as com- 
pared to said parent subtilase, which method comprises: 

i) analyzing the three-dimensional structure of the subtilase to identify, on the basis of an 
25 evaluation of structural considerations, at least one amino acid residue or at least one 

structural region of the subtilase, which is of relevance for altering said property; 

ii) constructing a variant of the subtilase, which as compared to the parent subtilase, has 
been modified in the amino acid residue or structural part identified in i) so as to alter said 
property; and 

30 iii) testing the resulting subtilase variant for said property. 

- Although it has been described in the following that modification of the parent subtilase in 
certain regions and/or positions is expected to confer a particular effect to the thus pro- 
duced subtilase variant, it should be noted that modification of the parent subtilase in any of 
35 such regions may also give rise to any other of the above-mentioned effects. For example. 
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any of the regions and/or positions mentioned as being of particular interest with respect to, 
e.g., improved thermostability, may also give rise to, e.g., higher activity at a lower pH. an 
altered pH optimum, or increased specific activity, such as increased peptidase activity. 
Further aspects of the present invention relates to variants of a subtilase, the DNA encod- 
5 ing such variants and methods of preparing the variants. Still further aspects of the present 
invention relates to the use of the variants for various industrial purposes, in particular as 
an additive in detergent compositions. Other aspects of the present invention will be appar- 
ent from the below description as well as from the appended claims. 

10 BRIEF DESCRIPTION OF FIGURES AND APPENDrX 

Figure 1, Alignment of 3D sequences of protease JP170 (mature sequence from Appendix 
1), SD-521 (aam50084 from EP 1209233) and protease Y (aay44619 from WO99/67370). 
By 3D sequences is meant that the position of homologous residues are chosen by super- 
position of the 3D structures and subsequently the amino acid sequences are aligned 
15 based on these homologous positions. 

Figure 2, Superposition of JP170 and Savinase 3D structures, with indication of calcium 
binding sites. JP170: light structure and three ion-binding sites. Savinase: dark structure 
and two ion-binding sites. 

20 

Figure 3, Matrix of homology between subtilases pertaining to the JP170 and BPN' sub- 
groups. The sequences are identified by sequence database accession numbers. 

APPENDIX 1 shows the structural coordinates for the solved crystal structure of JP170. 

25 

DEFINITIONS 

Prior to discussing this invention in further detail, the following terms and conventions will 
first be defined. 

For a detailed description of the nomenclature of amino acids and nucleic acids, we refer to 
,30 WO 00/71691 page 5, hereby incorporated by reference. A description of the nomenclature 
of modifications introduced in a polypeptide by genetic manipulation can be found in WO 
00/71691 page 7-12, hereby incorporated by reference. 



35 



The term "subtilases" refer to a sub-group of serine protease according to Siezen et al.. 
Protein Engng. 4 (1991) 719-737 and Siezen et al. Protein Scierice 6 (1997) 501-523. Ser- 
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ine proteases or serine peptidases is a subgroup of proteases characterised by having a 
serine in the active site, which forms a covalent adduct with the substrate. Further the sub- 
tilases (and the serine proteases) are characterised by having two active site amino acid 
residues apart from the serine, namely a histidine and an aspartic acid residue. 
5 Subtiiases are defined by homology analysis of more than 170 amino acid sequences of 
serine proteases previously referred to as subtilisin-ltke proteases. The subtiiases may be 
divided into 6 sub-divisions, i.e. the Subtilis'in family, the Thermitase family, the Proteinase 
K family, the Lantibiotic peptidase family, the Kexin family and the Pyrolysin family. 
The Subtilisin family (EC 3.4.21.62) may be further divided into 3 sub-groups, i.e. I-S1 
10 ("true" subtilisins), I-S2 (highly alkaline proteases) and intracellular subtilisins. Definitions or 
grouping of enzymes may vary or change, however, in the context of the present invention 
the above division of subtiiases into sub-division or sub-groups shall be understood as 
those described by Siezen et al.. Protein Engng. 4 (1991) 719-737 and Siezen et al. Pro- 
tein Science 6 (1997) 501-523. 

15 

The term "parent" is in the context of the present invention to be understood as a protein, 
which is modified to create a protein variant. The parent protein may be a naturally occur- 
ring (wild-type) polypeptide or it may be a variant thereof prepared by any suitable means. 
For instance, the parent protein may be a variant of a naturally occurring protein which has 
20 been modified by substitution, chemical modification, deletion or truncation of one or more 
amino acid residues, or by addition or insertion of one or more amino acid residues to the 
amino acid sequence, of a naturally-occurring polypeptide. Thus the term "parent subtilase" 
refers to a subtilase which is modified to create a subtilase variant. 

■ 

25 The term ''variant" is in the context of the present invention to be understood as a protein 
which has been modified as compared to a parent protein at one or more amino acid resi- 
dues. 

The term "modification(s)*' or "modified" is in the context of the present invention to be un- 
30 derstood as to include chemical modification of a protein as well as genetic manipulation of 
the DNA encoding a protein. The modification(s) may be replacement(s) of the amino acid 
side chain(s), substitution(s), deletion(s) and/or insertions in or at the amino acid(s) of in- 
terest. Thus the term "modified protein", e.g. "modified subtilase". is to be understood as a 
protein which contains modification(s) compared to a parent protein, e.g. subtilase. 

35 
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The term "JP170 subtilase** or "JP170 like subtilase" should in the context of the present 
invention be understood as a subtilase belonging to the Subtilisin group according to 
Siezen et al. Protein Science 6 (1997) 501-523 and which has at least 58% homology to 
JP170 SEQ ID NO:1, Thus, among others the alkaline proteases KP43, KP1790. KP9B60. 
5 Y and SD-521 are subtiiases belonging to the JP170 subgroup of subtilases. In the context 
of the present invention a JP170 subtilase has three ion-binding sites. However, the num- 
ber of ion-binding sites may vary in similar structures depending on the medium used for 
crystallisation. It appears e.g. that two of five ion-binding sites of Bacillus sphaericus 
"sphericase" (PDB NO:1EA7, Protein Data Bank) were due to a calcium containing crystal- 
10 lisation medium. 

The term "(a) BPN' subtilase" or "(a) BPN' like subtilase" should in the context of the pre- 
sent invention be understood as a subtilase belonging to the Subtilisin group according 
Siezen et al. Siezen et al. Protein Science 6 (1997) 501-523 and which has at least 61% 
15 homology to BPN' SEQ ID NO:5. Such a BPN' like subtilase is for example Savinase. In 
the context of the present invention a BPN' subtilase has two ion-binding sites. A BPN* like 
subtilase may, in the context of the present invention, belong to branch l-S of the subtilisins 
i.e. to branch I-S1, the "true" subtilisins or I-S2, the highly alkaline proteases (Siezen et al.. 
Protein Engng. 4 (1991) 719-737). 

20 

''Homology" or "homologous to' is in the context of the present invention to be understood 
in its conventional meaning and the ''homology" between two amino acid sequences should 
be determined by use of the "Similarity" defined by the GAP program from the University of 
Wisconsin Genetics Computer Group (UWGCG) package using default settings for align- 

25 ment parameters, comparison matrix, gap and gap extension penalties. Default values for 
GAP penalties, i.e. GAP creation penalty of 3.0 and GAP extension penalty of 0.1 (Program 
Manual for the Wisconsin Package, Version 8, August 1994, Genetics Computer Group. 
575 Science Drive, Madison. Wisconsin, USA 53711). The method is also described in S.B. 
Needleman and CD. Wunsch, Journal of Molecular Biology, 48, 443-445 (1970). Identities 

30 can be extracted from the same calculation. The homology between two amino acid se- 
quences can also be determined by "identity" or •^similarity" using the GAP routine of the 
UWGCG package version 9.1 with default setting for alignment parameters, comparison 
matrix, gap and gap extension penalties can also be applied using the following parame- 
ters: gap creation penalty = 8 and gap extension penalty = 8 and all other parameters kept 

35 at their default values. The output from the routine is besides the amino acid alignment the 
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calculation of the "Percent Identity" and the "Similarity" between the two sequences. The 
numbers calculated using UWGCG package version 9.1 is slightly different from the ver- 
sion 8. 

5 The term "position" is in the context of the present invention to be understood as the num- 
ber of an amino acid in a peptide or polypeptide when counting from the N-temiinal end of 
said peptide/polypeptide. The position numbers used in the present invention refer to dif- 
ferent subtflases depending on which subgroup the subtilase belongs to. 

10 As mentioned above the alkaline subtilases KP43, KP1790, KP9860, Y, SD-521 and vari- 
ants aamSOODO, aam50086, aamSOOBS. dam50084, aam50083, aam500&2, aam50081, 
aamSOOSO of EP 1209233 belong to the JP170 subgroup, based on sequence homology. 
Due to the extensive homology only subtilase Y and SD-521 are aligned with JP170. The Y 
subtilase and SD-521 subtilase are numbered according to SEQ ID NO:2 and 3 respec- 

15 lively. 

Likewise other subtilases belonging to the JP170 subgroup are numbered individually ac- 
cording to their own sequence. However in order to determine homologous positions in 
such other subtilases an alignment with the each of SEQ ID'S NO:1, 2 and 3 is conducted 
according to the GAP procedure described above. Subsequently the homologous positions 
20 are determined with reference to the most homologous of SEQ ID's NO:1 , 2 and 3. 

Alternatively subtilases belonging to the JP170 subgroup can be numbered by reference to 
the positions of JP170 subtilase (SEQ ID NO:1). 

25 Subtilases belonging to the BPN' subgroup refers to the positions of Subtiiisin Novo (BPN') 
from B. Qmyloliquefaciens. 

DETAILED DESCRIPTION OF THE INVENTION 

Despite the great homology of the subtilases described above the inventors of the present 
30 invention have elucidated the three-dimensional structure of JP170. SEQ ID NO:1 by X-ray 
crystallography and found that there are several differences between this and the three- 
dimensional structure of BPN'. The Inventors of the present invention have further com- 
pared the sequence homology of subtilases belonging to the Subtiiisin subgroup. This is 
shown in Figure 3 of the present invention. 
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On the basis of this comparison the inventors of the present invention suggest to divide the 
Subtilisin subgroup so that the JP170 subtilases become a separate subgroup in addition 
to the subgroups of BPN' subtilases and TY145 subtilases (DKPTO, PA 2003 001 19). 

5 JP170 subtilases 

As described above a JP170 like subtilase is in the context of the present invention to be 
understood as a subtilase which has at least 58% homology to SEQ ID NO:1. In particular 
said JP170 subtilase may have at least 60% homology to SEQ ID NO:1. such as at least 
65%, at least 70%, at least 75%. at least 80%. at least 85%, at least 90%, at least 91%, at 
10 least 92%. at least 93%, at least 94%, at least 95%, at least 96%, at least 97%. at least 
98% or at least 99% homology to JP170, i.e. to SEQ ID NO:1 . 

In a first embodiment of the present invention a JP170 subtilase suitable for the purpose 
described herein may be a subtilase homologous to the three-dimensional structure of 
15 JP170, i.e. it may be homologous to the three-dimensional structure defined by the struc- 
ture coordinates in Appendix 1 . 

As it is well-known to a person skilled in the art that a set of structure coordinates for a pro- 
tein or a portion thereof is a relative set of points that define a shape in three dimensions, it 
is possible that an entirely different set of coordinates could define an identical or a similar 
20 shape. Moreover, slight variations in the individual coordinates may tiave little or no effect 
on the overall shape. 

These variations in coordinates may be generated because of mathematical manipulations 
of the structure coordinates. For example, the structure coordinates of JP170 (Appendix 1) 
may be manipulated by crystallographic permutations of the structure coordinates, fraction- 
25 alization of the structure coordinates, integer additions or subtractions to sets of the struc- 
ture coordinates, inversion of the structure coordinates or any combination of the above. 
Alternatively, said variations may be due to differences in the primary amino acid se- 
quence. 

If such variations are within an acceptable standard error as compared to the structure co- 
30 ordinates of Appendix 1 said three-dimensional structure is within the context of the present 
invention to be understood as being homologous to the structure of Appendix 1. The stan- 
dard error may typically be measured as the root mean square deviation of e.g. conserved 
backbone residues, where the term "root mean square deviation" (RMS) means the square 
root of the arithmetic mean of the squares of the deviations from the mean. 

35 



8 



10321. 000-DK 

As it is also well-known to a person skilled in the art that within a group of proteins which 
have a homologous structure there may be variations in the three-dimensional structure in 
certain areas or domains of the structure, e.g. loops, which are not or at least only of a 
small importance to the functional domains of the structure, but which may result in a big 
5 root mean square deviation of the conserved residue backbone atoms between said struc- 
tures. 

Thus it is well known that a set of structure coordinates is unique to the crystallised protein. 
No other three dimensional structure will have the exact same set of coordinates, be it a 

.10 homologous structure or even the same protein crystallised in different manner. There are 
natural fluctuations in the coordinates. The overall structure and the inter-atomic relation- 
ship can be found to be similar. The similarity can be discussed in terms of root mean 
square deviation of each atom of a structure from each "homologous" atom of another 
structure. However, only identical proteins have the exact same number of atoms. There- 

15 fore, proteins having a similarity below 100% will normally have a different number of at- 
oms, and thus the root mean square deviation can not be calculated on all atoms, but only 
the ones that are considered "homologous". A precise description of the similarity based on 
the coordinates is thus difTicult to describe and difficult to compute for homologous proteins. 
Regarding the present invention, similarities in 3D structure of different subtilases can be 

20 described by the content of homologous structural elements, and/or the similarity in amino 
acid or DNA sequence. For sequences having no deletions or insertions a RMS for the cal- 
cium atoms can be calculated. 

Examples of JP170 like subtilases include the alkaline proteases KP43, KP1790. KP9860, 
25 Y, SD-521 and variants aam50090, aam50086, aam'50085, aam50084. aam50083, 
aam50082, aam50081. aam50080 of EP 1209233. however to the best of our knowledge 
the three-dimensional structure has not been solved for any of these subtilases. 

Accordingly, a preferred embodiment of the present invention is a parent subtilase or a 
30 subtilase variant which is at least 58% homologous to the sequence of SEQ ID NO:1 . pref- 
erably at least 60, at least 65%, at least 70%. at least 75%, at least 80%, at least 85%, at 
least 90%. at least 91%. at least 92%. at least 93%, at least 94%, at least 95%, at least 
96%. at least 97%, at least 98% or at least 99% homologous to the sequence of SEQ ID 
NO:1, and optionally said subtilase further comprises the following structural characteris- 
35 tics: 
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a) a twisted beta-sheet with 7 strands, 

b) six alpha helices, 

c) three ion-binding sites and 

not comprising the Strong and Weak ion-binding site of the BPN' like subtilases, and with 
5 the exception of the subtilases JP170, KP1790. KP9860. KP43, Y, SD-521 and variants 
aam50090, aam50086, aam50085, aam50084, aam50083, aam50082, aam50081, 
aamSOOSO of EP 1209233. 

The JP170 subtitase of the present invention is encoded by an isolated nucleic acid se- 
10 quence, which nucleic acid sequence encodes a subtilase which has at least 58% homol- 
ogy to SEQ ID NO:1. In particular said nucleic acid sequence encodes a subtilase that has 
at least 60% homology to SEQ ID NO:1. such as at least 65%, at least 70%, at least 75%, 
at least 80%, at least 85%. at least 90%, at least 91%. at least 92%, at least 93%. at least 
94%, at least 95%. at least 96%, at least 97%. at least 98% or at least 99% homology to 
15 SEQ ID NO:1, i.e. to the amino acid sequence of JP170. 

Further the isolated nucleic acid sequence encoding a JP170 subtilase of the invention hy- 
bridizes with a complementary strand of a nucleic acid sequence encoding the amino acid 
sequence of SEQ ID NO:1 preferably under low stringency conditions, at least under me- 
20 dium stringency conditions, at least under medium/high stringency conditions, at least un^ 
der high stringency conditions, at least under very high stringency conditions. 

Suitable experimental conditions for determining hybridization at low, medium, or high 
stringency conditions between a nucleotide probe and a homologous DNA or RNA se- 

25 quence involves presoaking of the filter containing the DNA fragments or RNA to hybridize 
in 5 X SSC (Sodium chloride/Sodium citrate, Sambrook et al. 1989) for 10 min, and prehy- 
bridization of the filter in a solution of 5 x SSC, 5 x Denhardt's solution (Sambrook et al. 
1989), 0.5 % SDS and 100 ^g/ml of denatured sonicated salmon sperm DNA (Sambrook et 
al. 1989). followed by hybridization in the same solution containing a concentration of 

30 lOng/ml of a random-primed (Feinberg. A. P. and Vogelstein, B. (1983) Anai Biochem. 
132:6-13). ^^P-dCTP-labeled (specific activity > 1 x 10^ cpm/MQ ) probe for 12 hours at ca. 
45X. The filter is then washed twice for 30 minutes in 2 x SSC, 0.5 % SDS at least * 55X 
(low stringency), more preferably at least 60°C (medium stringency), still more preferably at 
least 65*C (medium/high stringency), even more preferably at least 70"C (high stringency), 

35 and even more preferably at least 75**C (very high stringency). 
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BPN' subtilases 

As described above a BPN' subtilase rs in the context of the present invention to be under- 
stood as a subtilase which has at least 61% homology to SEQ ID NO:4. in particular said 
BPN' subtilase n^ay have at least 65%. such as at least 70%, at least 75%, at least 80%, at 
5 least 85%, at least 90%, at least 91%, at least 92%. at least 93%, at least 94%, at least 
95%. at least 96%, at least 97%, at least 98% or at least 99% homology to BPN', i.e. to 
SEQ ID NO:4. 

In one embodiment of the present invention a BPN' subtilase suitable for the purpose de- 
10 scribed herein may be a subtilase homologous to the three-dimensional structure of BPN* 
as defined by the structure coordinates given in PDB Nos. 1SBT and 1GNS (Protein Data 
Bank), or one of the several other structures of BPN' that are accessible from the Protein 
Data Bank. Variations t>etween homologous structures may occur for several reasons as 
described above. Thus a BPN' subtilase within the context of the present invention is to be 
15 understood as any subtilase having the structural characteristics pertaining to the BPN' 
subtilases as described above, and in addition such subtilases does preferably not have 
further structural characteristics which are not present in the BPN' subtilases as described 
herein. Further a BPN' subtilase of the present invention may have the necessary percent- 
age of similarity with SEQ ID NO;4. 
20 ' ■ ■ 

Examples of BPN' like subtilases include the subtilisin 309 (PDB NO:1SVN, SAVINASE®, 
NOVOZYMES A/S) and subtilisin Carisberg (ALCALASE®, NOVOZYMES A/S), among 
others. 

In figure 1 of R.J. Siezen and J.A,M Leunissen (Protein science. Vol 6 (3), pp. 501-523, 
25 1997) page 502 a structure of subtilases is descrit>ed. A subtilase consists of 6-8 helices, 
11 strands of which 7 are central in a twisted beta-sheet. Two ton-binding sites are men- 
tioned, the so called "Strong" and "Weak" calcium-binding sites. It was later discovered that 
for some structures (subtilisin DY PDB no. 1BH6, 1998), the Weak calcium-binding site 
was shown to be a Na (sodium) binding site when the calcium concentration in the crystal- 
30 lization medium was low. Thus, in the following we refer to ion-binding sites instead of cal- 
cium-binding sites. 

« 

The BPN' subtilase of the present invention is encoded by an isolated nucleic acid se- 
quence, which nucleic acid sequence encodes a subtilase which has at least 61% homol- 
35 ogy to SEQ ID NO:4. In particular said BPN' subtilase may have at least 65%. such as at 
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least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 
92%, at least 93%. at least 94%, at least 95%. at least 96%. at least 97%, at least 98% or 
at least 99% homology to BPN\ i.e. to SEQ ID NO:4. 

5 Further the isolated nucleic acid sequence encoding a BPN' subtilase of the invention hy* 
bridizes with a complementary strand of the nucleic acid sequence encoding the amino 
acid sequence of SEQ 10 NO:4 preferably under low stringency conditions, at least under 
medium stringency conditions, at (east under medium/high stringency conditions, at least 
under high stringency conditions, at least under very high stringency conditions. 

10 

Three-dimensional structure of JP170 subtilases 

The JP170 subtilase was used to elucidate the three-dimensional structure forming the ba- 
sis for the present invention. 

The structure of JP170 was solved in accordance with the principle for x-ray crystallo- 
15 graphic methods, for example, as given in X-Ray Structure Determination, Stout, G.K. and 
Jensen, L.H.. John Wiley & Sons. Inc. NY, 1989. 

The structural coordinates for the solved crystal structure of JP170 are given in standard 
PDB format (Protein Data Bank, Brookhaven National Laboratory, Brookhaven, CT) as set 

20 forth in Appendix 1 . It is to be understood that Appendix 1 forms part of the present applica- 
tion. In the context of Appendix 1, the following abbreviations are used: CA refers to c- 
alpha (carbon atoms) or to calcium ions, (however to avoid misunderstandings we use the 
full names "c-alpha atoms*' and **calcium" or "ron'* in the present specification). Amino acid 
residues are given in their standard three-letter code. The attached structural coordinates 

25 contain the protease structure, and an inhibitor structure CI2 as well as water molecules. 
The protease coordinates has a chain identification called A. whereas the CI2 inhibitor is 
called B, the calcium ions are called C, and the water is W. In the following the positions of 
the mentioned residues refer to the sequence of JP17Q as disclosed in SEQ ID NO:1. 

20 The JP170 structure consists of two domains, a catalytic domain and a C-terminal domain. 
The structure of the catalytic domain shows the same overall fold as found in the S8 family 
of subtilisins. The structure comprises a twisted beta-sheet with 7 strands arranged in the 
following sequential order S2, S3, S1. S4, S5. S6. S7. 
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There are six alpha helices in the catalytic domain structure of which number HI contains 
residues 9-17, H2 contains residues 68-76, H3 contains residues 110-119. H4 contains 
residues 139-150, H5 contains residues 253-273 and H6 contains residues 281-291. 
The C-terminal domain comprises a strand motif, a so called "beta sandwich" consisting of 
5 sheets a and b. The sheet in this domain is combined of strands in an anti-parallel fashion, 
whereas the strand in the catalytic domain is combined in parallel. The sequential order of 
the strands can be denoted as: S1a-S1b-S3a-S3b-S4b-S4a-S2b-S2a with the beta sand- 
wich organised as to the two sheets SI a, S3a, S4a, S2a and Sib, S3b. S4b, S2b. 

10 The JP170 subtilases are shown to lack the well-known Strong and Weak ion-binding sites 
of the BPN' subtilases. However, the JP170 subtilases have three ion-binding sites which 
are not present in the BPN' subtilisin structures. This can be seen in the structural align- 
ment presented in Figure 2. These three ion-binding sites are hereinafter referred to as Site 
1 , which is placed in the catalytic domain, and Site 2 and 3 which are placed in the non- 

15 catalytic C-terminai domain. 

Thus in relation to the atomic coordinates disclosed in Appendix 1, the ion-binding sites of 
JP170 are located at: 
Site 1 - calcium atom named A601 CA 
20 Site 2 - calcium atom named A603 CA, and 

Site 3 - calcium atom named A602 CA in the PDB table (Appendix 1). 

The position of an ion-binding srte can be defined by the distance to four specific atoms in 
the core structure. The distance from the ion-binding site to the c-alpha atoms of the three 

25 active site residues has been chosen. Throughout the subtilases the residues Ser, His and 
Asp in the active site are highly conserved. In JP170 they are Asp30, His68 and Ser254. 
The fourth distance chosen is the distance to the c-alpha atom of the amino acid residue 
coming first after the active site serine residue in the sequence (herein after called "next to 
Ser"); in the 3D structure of JP1 70 it is Met255. 

30 In a preferred embodiment of the present invention, the distance between: 

a) ion-binding site 1 and i) Asp c-alpha atom is 26.70-28.70A, ii) His c-alpha atom is 
22.1 0-24. lOA. iii) Ser c-alpha atom is 1 6.95-1 8.95A. iv) next to Ser c-alpha atom is 
15.30-1 7.30A, 

b) ion-binding site 2 and i) Asp c-alpha atom is 33.50-35. 50A, ii) His c-alpha atom is 37- 
35 39A, iii) Ser c-alpha atom is 29.40-31. 40A, iv) next to Ser c-alpha atom is 30.70- 
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32.70A, 

c) ion-binding site 3 and i) Asp c-alpha aXoxw is 41.50-43.50A, ii) His c-alpha atonr^ is 

42.90-44.90A. ill) Set c-alpha atom is 34.50-36.50A, iv) next to Ser c-alpha atom is 35- 
37A. 

5 

Below are the specific distances between the four chosen c-alpha atoms and the three ion 



binding sites of the JP170 subtilase given 


in A: 




site 1 


site 2 


sites 


Met255 


16.34 


31 .68 


36.02 


His68 


23.12 


38.03 


43.87 


Asp30 


27.69 


34.49 


42.48 


Ser254 


17.95 


30.41 


35.51 


site 1 


0 


35.29 


32.92 


site 2 


35.29 


0 


14.08 


site 3 


32.92 


14.08 


0 



However these distances may vary from one subtilase to the other. The present distances 
are given with a calcium ion in the structure. If a sodium ion was bound instead the dis- 
tances would be shifted a little bit. Generally the distances can vary ±0.80A, preferably 
20 +0.70A, ±0.60A, ±0.50A, ±0.40A. or most preferably ±0.30A. 

Further, in the JP170 like subtilases, the peptide structure circumventing ion-binding site 1 
up to a distance of 10 A is composed of the amino acid residues placed in positions 183- 
189, 191-204 and 224-225. 
25 The peptide structure circumventing ion-binding site 2 up to a distance of 1 0 A is composed 
of residues 378-393. 

The peptide structure circumventing ion-binding site 3 up to a distance of 10 A is composed 
of residues 348. 350, 352, 363-370, 380-383, 391-400 and 414-420. 

30 In comparison with the BPN' like subtilase structures the structure of the JP170 like subti- 
lases can be divided into a "core subtilase-like" region, an "intermediate" region and a 
"nonhomologous" region. 

The active site can be found in the core subtilase-like region, which is structurally closely 
related to the BPN' structures. The core subtilase-like region is composed of residues 17- 
35 34, 197-209 and 216-232, and contains the alpha-helix H3 and the central alpha-helix H5 in 
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which the active site serine residue is situated in the N-tem>inal part. The core subtilase- 
like region has an RMS lower than 1.2. 

Outside the core subtilase-like region the structure of the JP170 like subtilase differs from 
the BPN' structures to a greater extent. 

5 

The intemnediate region consists of residues 42-46, 150-186, 245-272 and 278-296. The 
intemnediate region has an RMS bigger than 1.2 and less than 1 .8. The relationships be- 
tween the three-dimensional structure and functionality are potentially difficult to predict in 
this region of the JP170 fike subtilases. 

10 

The nonhomologous region consists of residues 1-16. 35-41, 47-149. 187-196, 210-215, 
233-244, 273-277 and 297-316. The nonhomologous region has a RMS higher than 1 .8. 
The relationships between the three-dimensional structure and functionality are very diffi- 
cult to predict in this region of the JP170 like subtilases. 

15 

Many loops in the 3D structure of the JP170 like subtilases differ significantly from the BPN' 

type structures, both in length and In content of amino add residues. The following loops or 

protein sequence stretches of JP170 are compared to Savinase (in parenthesis): 

G32-H43 (G34-H39) 
20 E44-Y54 (P40-A48) 

G57-G67 (V51-G63) 

N79-N82 (I75-V81 ) 

I96-P107 (V95-S105) 

A108-S119(106-N117) 
25 A131-Y137(S128-S132) 

T1 38-D1 52 (A1 33-G1 46) 

El 62-11 69 (SI 56-1 165) 

G1 73-T1 80 (A1 69-A1 76) 

E185-N199 (D181-N184) 
30 G208-D2 1 8 (G 1 93-D 1 97) 

S232-K246 (G21 1 -T21 3) 

D294-N303 (S256-L262) 



35 



The loops N79-N82 (I75-V81) and G208-D218 (G193-D197) are in contact with a lonr 
binding site in Savinase. but not in JP170. Similarly the loop E185-N199 (D181-N184) is in 
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contact with a ion-binding site in JP170, but not in Savinase. This knowledge opens for 
possibilities of adding or removing ion-binding sites to subtilases of the JP170 and BPN' 
like types. 

5 A good example of the difference is the loop S232-K246 which has 15 residues compared 
to the corresponding BPN' type loop G211-T213 (in Savinase), which has only three resi- 
dues. In the JP170 like subtilases, the loop folds back to the substrate binding site, espe- 
cially the P' parts of the substrate binding site. The loop is situated close to the substrate as 
illustrated by the CI2 inhibitor bound in the 3D structure attached (Appendix 1). 

10 

The location of loop S232-K246 can be described in relation to the four specific residues as 

described above. The distance from the CA atom of residue W240 in the loop to the CA 

atoms of the active site residues are: 

Residue H68 D30 S254 M255 

Distance. A 11.45 18.51 13.06 11.94 

15 As mentioned above, distances like these can vary ±0.80A. preferably +0.70A, ±0.60A, 
±0.50A. ±0.40A, or most preferably ±0.30A. 

Furthermore, distances from the residues of JP170 loop S232-K246 to atoms of the CI2 
inhibitor can be calculated. These distances are: 
20 from CA atom of W240 to CA atom of R62 in CI2 is 7.49A, 
from CA atom of F239 to CA atom of R62 in CI2 is 8.39A, 
from CA atom of S238 to CA atom of R62 In CI2 is 8.42A, 
from CA atom of S237 to CA atom of R62 in CI2 is 9.44A, 
from CA atom of S238 to CA atom of E60 in CI2 is 9.42A. 

25 

The distances from JP170 active site, residue S254 to atoms of the CI2 inhibitor, as placed 
in the 3D coordinates of Appendix 1 , are: 
from CA atom of S254 to CA atom of E60 in CI2 is 5.25A. 
from CA atom of S254 to CA atom of R62 in CI2 is 1 1 .55A, 
30 from CA atom of S254 to CA atom of T58 in CI2 is 7.06A, 
from CA atom of S254 to CA atom of M59 in CI2 is 4.71 A. 

The distances can vary ±0.80A, preferably ±0-70A, ±0.60A, ±0.50A, ±0.40A, or most 
preferably ±0.30A. 
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A preferred JP17Q like subtilase variant has a deletion in the region S232-K246, and the 
subsequent insertion of one or more residues to partly or completely remove the loop. Pre- 
ferred variants comprises the deletion of L233-S245 + insertion of Asn, deletion of L233- 
D244 ^ insertion of Gly or deletion of S232-D244 insertion of Gly. 

5 

Homology building of JP170 and BPN' subtilases 

A model structure of a JP170 like subtilase or a BPN' like subtilase can be built using the 
Homology program or a comparable program, e.g.. Modeller (both from Molecular Simula- 
tions, Inc., San Diego, CA). The principle is to align the amino acid sequence of a protein 

10 for which the 3D structure is known with the amino acid sequence of a protein for which a 
model 3D structure has to be constructed. The structurally conserved regions can then be 
buitt on the basis of consensus sequences. In areas lacking homology, loop structures can 
be inserted, or sequences can be deleted with subsequent bonding of the necessary resi- 
dues using, e.g., the program Homology. Subsequent relaxing and optimization of the 

15 structure should be done using either Homology or another molecular simulation program, 
e.g., CHARMm from Molecular Simulations. 



■Methods for designing JP170 and BPN' subtilase variants 

Comparisons of the molecular dynamics of different proteins can give a hint as to which 
20 domains are important or connected to certain properties pertained by each protein. 

The present invention comprises a method of producing a variant of a parent JP170 like 
subtilase, the variant having at least one altered property as compared to the parent JP170 
like subtilase, the method comprising: 

a) modelling the parent JP170 subtilase on the three-dimensional structure of a JP170 
25 subtilase to produce a three-dimensional structure of the parent JP1 70 subtilase; 

b) kientifying on the basis of the comparison in step a) at least one structural part of the 
parent JP170 subtilase, wherein an alteration in said structural part is predicted to re- 
sult in an altered property; 

c) modifying the nucleic acid sequence encoding the parent JP170 subtilase to produce a 
30 nucleic acid sequence encoding deletion or substitution of one or more amino acids at 

a position corresponding to said structural part, or an insertion of one or more amino 
acid residues in positions corresponding to said structural part; 

d) expressing the modified nucleic acid sequence in a host cell to produce the variant 
J PI 70 subtilase; 

35 e) isolating the produced subtilase; 

17 



10321.000-DK 



f) purifying the isolated subtilase and 

g) recovering the purified subtilase. 

Further the present invention comprises a method of producing a variant of a parent Subtil- 
5 isin family subtilase. such as a BPN' like subtilase. the variant having at least one altered 
property as compared to the parent Subtilisin fami^ subtilase, the method comprising: 
a) modelling the parent Subtilisin family subtilase on the threenjimensional structure of a 

Subtilisin family subtilase to produce a three-dimensional structure of the parent Subtit- 

isin family subtilase; 

10 b) comparing the three-dimensional structure obtained in step a) to the three-dimensional 
structure of a JP170 like subtilase; 
c) identifying on the basis of the comparison in step b) at least one structural part of the 
parent Subtilisin family subtilase, wherein an alteration in said structural part is pre- 
dicted to result in an altered property; 

15 d) modifying the nucleic acid sequence encoding the parent Subtilisin family subtilase to 
produce a nucleic acid sequence encoding deletion or substitution of one or more 
amino acids at a position corresponding to said structural part, or an insertion of one or 
more amino acid residues in positions corresponding to said structural part; 

e) expressing the modified nucleic acid sequence in a host cell to produce the variant 
20 Subtilisin family subtilase, 

f) isolating the produced subtilase, 

g) purifying the isolated subtilase and 

h) recovering the purified subtilase. 

25 Further the present invention comprises a method of producing a variant of a parent JP170 
like subtilase, the variant having at least one altered property as compared to the parent 
JP170 like subtilase, the method comprising: 

a) modelling the parent JP170 like subtilase on the three-dimensional structure of a 
JP170 like subtilase to produce a three-dimensional structure of the parent JP170 like 

30 subtilase; 

b) comparing the three-dimensional structure obtained in step a) to the three-dimensional 
structure of a Subtilisin family subtilase; 

c) identifying on the basis of the comparison in step b) at least one structural part of the 
parent JP170 like subtilase, wherein an alteration in said structural part is predicted to 

35 result in an aftered property; 
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d) modifying the nucleic acid sequence encoding the parent JP170 like subtilase to pro- 
duce a nucleic acid sequence encoding deletion or substitution of one or more amino 
acids at a position corresponding to said structural part, or an insertion of one or more 
amino acid residues in positions corresponding to said structural part; 

e) expressing the modified nucleic acid sequence in a host cell to produce the variant 
JP170 like subtilase; 

f) isolating the produced subtilase; 

g) purifying the isolated subtilase and 

h) recovering the purified subtilase. 



Stability - alteration of ion-binding sites 

■ 

As described above the JP170 subtilases has three new ion-binding sites not present in the 
BPN' subtilisin structures but lacks the Strong and Weai< ion-binding site of the BPN' subti- 
lases. Stability of the ion-binding site is of crucial importance for the functionality of the en- 
15 zyme. Therefore alterations of the ion-binding sites are likely to result in alterations of the 
stability of the enzyme. 

improved stability 

Stabilisation of a JP170 subtilase may possibly be obtained by alterations in the positions 
20 close to the ion-binding sites. Thus a preferred variant of the present invention has a 'modi- 
fication in one or more of the positions located at a distance of lOA to the ion-binding sites 
of JP170 (SEQ ID NO:1). The positions are: 

Site 1 : 1 83-1 89 (I.e. positions 1 83, 1 84, 1 85, 1 86, 1 87, 1 88, 1 89), 
25 191-204 (I.e. positions 191. 192. 193, 194. 195. 196. 197, 198. 199, 200. 

201, 202. 203. 204). 

224-225; 

Site 2: 378-393 (i.e. positions 378, 379. 380, 381 , 382, 383, 384, 385, 386, 387. 
30 388. 389, 390, 391 . 392, 393); 



Site 3: 348. 350. 352. 

363-370 (i.e. positions 363. 364. 365. 366, 367. 368, 369. 370), 
380-383 (i.e. positions 380. 381 , 382. 383). 
35 391-400 (i.e. positions 391 , 392, 393. 394, 395, 396, 397. 398, 399. 400), 
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414-420 (i.e. positions 414, 415. 416, 417, 418, 419, 420). 

in detergent compositions calcium chelaters contribute to removal of calcium from the sub- 
tilases with subsequent inactivation of the enzyme as the result. To decrease the inactiva- 
5 tion due to calcium removal of e.g. calcium chelaters variants with improved calcium stabil- 
ity was constructed. 

Preferred variants stabilised in ion-binding site 1 are S193Q,Y; H200D,N and 
H200D.N+D196N. 

Preferred variants stabilised in ion-binding site 2 are N390D and N391D, and preferred 
10 variants stabilised in ion-binding site 3 are G394N,Q,F,Y,S and W392S,N,Q. 

Alteration of thermostability 

A variant with improved stability (typically increased thermostability) may be obtained by 
substitution with proline, introduction of a disulfide bond, altering a hydrogen bond contact, 
15 altering charge distribution, introduction of a salt bridge, filling in an internal structural cavity 
with one or more amino acids with bulkier side groups (in e.g. regions which are structurally 
mobile), substitution of histidine residues with other amino acids, removal of a deamidation 
sites, or by helix capping. 

20 Regions with increased mobilitv: 

The following regions of JP170 have an increased mobility in the crystal structure of the 
enzyme, and it is presently believed that these regions can be responsible for stability or 
activity of JP170, Especially thermostabiiisation may possibly be obtained by altering the 
highly mobile regions. Improvements of the enzyme can be obtained by mutation in the be- 

25 low regions and positions. Introducing e.g. larger residues or residues having more atoms 
in the side chain could increase the stability, or, e.g., introduction of residues having fewer 
atoms in the side chain could t>e important for the mobility and thus the activity profile of the 
enzyme. 

Two methods are used extract the highly mobile regions from a 3D structure. One is a mo- 
30 lecular dynamics calculation of the isotropic fluctuations, and the other is an analysis of the 
B-factors. The B-factors are listed in the FOB file and give a value to the uncertainty of de- 
termination of the location of the various atoms of the structure. The uncertainty relates to 
the mobility of the atoms in the molecules in the crystal lattice. This mobility reflects the 
thermal motion of the atoms and thus indicates possible sites for thermostabiiisation of the 
35 enzyme. 
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Thus, by analysing the B-factors taken from the coordinate file in Appendix 1 , (see "in X- 
Ray Structure Determination. Stout, G.K. and Jensen, L.H., John Wiley & Sons, Inc. NY, 
1 989") the following mobile regions in the JP170 structure were revealed: 



10 



15 



20 



13-18 

37-43 

47-50 

57-59 

96-103 

131-134 

152-153 

162-166 

188-195 

210 

234-246 



372-378 
387-392 
406-407 
419. 



i.e. positions 13, 14, 15, 16, 17, 18), 
i.e. positions 37. 38, 39, 40, 41. 42, 43), 
i.e. positions 47, 48, 49. 50). 
i.e. positions 57. 58. 59). 

i.e. positions 96, 97. 98, 99, 100. 101, 102, 103), 
i.e. positions 131. 132. 133. 134), 

i.e. positions 162. 163, 164, 165, 166), 

i.e. positions 188, 189, 190, 191, 192, 193, 194, 195), 

i.e. positions 234, 235. 236, 237, 238, 239, 240, 241, 242, 243. 244. 245, 
246). 

i.e. positions 372. 373. 374. 375. 376, 377. 378), 
i.e. positions 387. 388, 389. 390. 391, 392), 



Molecular dynamics simulation at 300K and 400K of JP170 reveals the following highly 
mobile regions: 



25 37-42 
57-60 
66-67. 
98-103 
107-111 

30 188-193 
236-240 
326-332 
337-342 
355-360 

35 372-377 



i.e. positions 37. 38. 39, 40, 41, 42), 
i.e. positions 57. 58, 59, 60), 

i.e. positions 98. 99, 100. 101, 102, 103), 

i.e. positions 107, 108, 109. 110, 111), 

i.e. positions 186, 189, 190, 191, 192, 193), 

i.e. positions 236, 237, 238, 239, 240). 

i.e. positions 326, 327. 328, 329. 330. 331. 332). 

i.e. positions 337. 338. 339, 340. 341. 342). 

i.e. positions 355. 356, 357. 358. 359, 360). 

i.e. positions 372. 373, 374, 375. 376, 377), 
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384-388 (i.e. positions 384, 385. 386, 387. 388). 

404-41 1 (i.e. positions 404. 405, 406, 407, 408. 409, 410, 411). 

Thus, a preferred JP170 subtilase variant of the present invention has been modified in one 
or more of the above mentioned positions of SEQ ID NO:1. Further preferred variants com- 
prises one or more alterations in the regions 57-60, 66-67, 107-111, 236-240, 326-332. 
355-360, 372-377. 384-388, 404-411. Especially preferred is variant W240H.Y and variants 
modified in the region 355-360, such as variants comprising one or more of the modifica- 
tions: G355A.S; S356T,N; T357N,Q.D,E,P; T358S; A359S,T,N,Q and S360T,N. 



Variants modified in the region 355-360 may be produced in accordance with the method 
for random mutagenesis by use of the DOPE program as described herein. To obtain vari- 
ants comprising 1-3 modifications in region 355-360 one may introduce the substitutions 
with the following frequencies: 

15 wild-type modified 

95% 5% G355A.S 

90% 10% S356T,N 

80% 20% T357N,Q.D,E,P 

90% 10% T358S 

20 80% 20% A359S,T,N,Q 

80% 20% S360T,N. 

Disulfide bonds: 

A JP1 70 variant of the present invention with improved stability, e.g. thermostability, as 
25 compared to the parent JP170 subtilase may be obtained by introducing new inter-domain 
or intra-domain bonds, such as by establishing inter- or intra-domain disulfide bridges. 
Thus a further aspect of the present invention relates to a method for producing a variant of 
a parent JP170 comprising the methods described in the paragraph "Methods of preparing 
JP170 like or BPN' like subtilase variants" herein, 

30 

According to the guidelines mentioned above the below mentioned amino acid residues 
identified in the amino acid sequence of SEQ ID NO:1 are considered as being suitable for 
cysteine replacement. With one or more of these substitutions with cysteine, disulfide 
bridges may possibly form In a variant of JP170. The substitutions are: G21C/A86C, 
35 V26C/A265C, G57C/G105C. G74C/A229C, Q111CA^143C, G160C/S170C. A286CA/349C. 
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A27C/A122C, A45C/G78C, V72C/P258C. G78C/A229C, D98C/G104C, Q111CA'147C. 
G1 35C/G1 67C. R142.C/P354C. V144C/A1 78C, G1 82C/P21 7C, A1 83C/G223C, 
A1 95C/Y225C. F271 C/P279C. A287C/A430C, A293C/S310C, E322C/S428C, 
S324C/A332C. S327C/P424C, D352C/N397C, G255C/T362C. G291C/S314C. 
5 Preferred variants comprise one or more of the substitutions: G21C/A86C, V26C/A265C, 
G57C/G105C, G74C/A229C, Q111C/Y143C, G160C/S170C. A286CA/349C, A4C/P222C 
and A27CA/117C. 

Similar residues suitable for cysteine replacement in subtilases homologous with JP170 
10 can be elucidated by finding the homologous positions in the alignment of Figure 1. Con- 
ceming another JP170 like sequence the homologous positions suitable for cysteine re- 
placement can be selected by aligning said JP170 like sequence with all of the sequences 
of Figure 1 using the GAP analysis method as described above. The suitable residues can 
then be selected in accordance with the homologous positions in the most homologous of 
15 SEQ ID'S NO:1, 2 and 3 which are the sequences of the subtilases aligned in Figure 1. 

Surface charge distribution 

A variant with improved stability (typically improved thermostability) as compared to the 
parent subtilase may be obtained by changing the surface charge distribution of the subti- 
20 lase- For example, when the pH is lowered to about 5 or below histidine residues typically 
become positively charged and, consequently, unfavorable electrostatic interactions on the 
protein surface may occur. By engineering the surface charge of the subtilase one may 
avoid such unfavorable electrostatic interactions that in turn lead to a higher stability of the 
subtilase. 

25 

Therefore, a further aspect of the present invention relates to method for constructing a 
variant of a parent subtilase, the method comprising: 

a) identifying, on the surface of the parent subtilase, preferably a JP170 like or a BPN' like 
subtilase. at least one amino acid residue selected from the group consisting of Asp, 

30 Glu. Arg, Lys and His; 

b) substituting, on the surface of the parent subtilase, at least one amino acid residue se- 
lected from the group consisting of Asp, Glu. Arg, Lys and His with an uncharged 
amino acid residue; 

c) optionally repeating steps a) and b) recursively; 
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d) Optionally, making alterations each of which is an insertion, a deletion or a substitution 
of an amino acid residue at one or more positions other than b); 

e) preparing the variant resulting from steps a) - d); 

f) testing the stability of said variant; and 

5 g) optionally repeating steps a) - f) recursively; and 

* 

h) selecting a subtilase variant having increased stability as compared to the parent subti- 
lase. 

As will be understood by the skilled person it may also, in some cases, be advantageous to 
10 substitute an uncharged amino acid residue with an amino acid residue bearing a charge 
or, alternatively, it may in some cases be advantageous to substitute an amino acid residue 
bearing a charge with an amino acid residue bearing a charge of opposite sign. Thus, the 
above-mentioned method may easily be employed by the skilled person also for these pur- 
poses. In the case of substituting an uncharged amino acid residue with an amino acid 
15 residue bearing a charge the above-mentioned method may be employed the only differ- 
. ence being steps a) and b) which will then read: 

a) identifying, on the surface of the parent subtilase, at least one uncharged amino acid 
residue; 

b) substituting, on the surface of the parent subtilase, at least one uncharged amino acid 
20 residue with a charged amino acid residue selected from the group consisting of Asp, 

Glu, Arg, Lys and His. 

Also in the case of changing the sign of an amino acid residue present on the surface of 
the subtilase the above method may be employed. Again, compared to the above method, 
25 the only difference being steps a) and b) which, in this case, read: 

a) identifying, on the surface of the parent subtilase. at least one charged amino acid resi- 
due selected from the group consisting of Asp, Glu, Arg, Lys and His; 

b) substituting, on the surface of the parent subtilase, at least one charged amino acid 
residue identified in step a) with an amino acid residue having an opposite charge. 

30 

Thus. Asp may be substituted with Arg, Lys or His; Glu may be substituted with Arg. Lys or 
His; Arg may be substituted with Asp or Glu; Lys may be substituted with Asp or Glu; and 
His may be substituted with Asp or Glu. 
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In order to determine the amino acid residues of a subtilase, which are present on the sur- 
face of the enzyme, the surface accessible area are measured using the DSSP program 
(Kabsch and Sander, Biopolymers (1983), 22. 2577-2637). All residues having a surface 
accessibilty higher than 0 is regarded a surface residue. 
5 Amino acid residues found on the surface of JP170 using the above method are N76, 
N316, L3B1, K246, K9. K313 and K83. We consider the substitutions N79D, N316D and 
L381 D of particular interest for stabilisation by introduction of salt bridges, whereas the 
substitutions K246R, K9R, K313R and K83R are of particular interest for the stabilisation at 
high pH. 

10 Similar substitutions may be introduced in equivalent positions of other JP170 like subti- 
lases. 

Substitution with proline residues 

Improved thermostability of a subtitase can be obtained by subjecting the subtilase in ques- 
15 tion to analysis for secondary structure, identifying residues in the subtilase having dihedral 
angles i|» (phi) and vj/ (psi) confined to the intervals [-90°<<ti<-40° and -180°<\|/<180^, preferab- 
ly the intervals [-90°<(t><-40° and 120°<v|/<180°] or [-90^<(l><-40° and -50''<v<10°] and exclud- 
ing residues located in regions in which the subtilase is characterized by possessing a-heiical 
or p-sheet structure. 

20 After the dihedral angles ^ (phi) and \|/ (psi) for the amino acids have been calculated, based 
on the atomic structure in the crystalline subtiiases, it is possible to select position(s) which 
has/have dihedral phi and psi angles favorable for substitution with a proline residue. The ali- 
phatic side chain of proline residues is bonded covalently to the nitrogen atom of the peptide 
group. The resulting cyclic ftve-membered ring consequently imposes a rigid constraint on the 

25 rotation about the N-Ca bond of the peptide backbone and simultaneously prevents the forma- 
tion of hydrogen bonding to the backbone N-atom. For these structural reasons, proline resi- 
dues are generally not compatible with a-helical and p-sheet secondary conformations. 
If a proline residue is not already at the identified position(s), the naturally occurring amino 
acid residue is substituted with a proline residue, preferably by site directed mutagenesis ap- 

30 plied on a gene encoding the subtilase in question. 

In the group of JP170 like subtilases proline residues can advantageously be introduced at 
positions 22, 44, 110. 139. 140, 166, 198, 201. 203, 231. 282. 356. 357 and 376. Accord- 
ingly, a preferred JP170 variant lias one or more of the substitutions: Q22P, E44P. L1 10P, 
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T139P. D140P. S166P I198P, V201P, Q203P. S231P. S2B2P, S356P. T357P and K378P. 
Especially preferred are variants comprising one or more of: E44P, Q203P and S356P. 

improved activity of JP170 subtilases 
5 As mentioned the JP170 subtilases differ greatly from the BPN* like subtilases in having a 
long apparently non-catalytic C-terminal. A possible truncation of JP170 is the removal of 
approx. 115 residues including two ion-binding sites, which can be obtained by deletion of 
or within the region 311-433. which rs non-catalytic C-terminal. Preferred deletions are of 
the regions 317-433 or 315-433. Preferably the new C-tenninal will be within the region of 
10 311-325. Further, the deletion can be optimised with additional substitutioris, such as one 
or more of L283N,Q; A290S,N and W306H.Y,K. 
Preferred truncations comprise: 

a) deletion of region 317-433 and the substitutions L283N + A290S + W30BH. 

b) deletion of region 315-433 and the substitutions L283N + A290S + W306H. 

15 

Substrate binding site 

The substrate binding site is identified by the residues in contact with a substrate model, 
such as the CI2 inhibitor. The 3D structure coordinates of the JP170 subtilase with CI2 
bound in the active site can be found in Appendix 1. Without being limited to any theory, it 
20 is presently believed that binding between a substrate and an enzyme is supported by fa- 
vorable interactions found within a sphere 10 A from the substrate molecule. Examples of 
such favorable bonds are hydrogen bonds, strong electrostatic Interaction and/or hydro- 
phobic interactions. 

The following residues of the JP170 subtilase (SEQ ID NO:1), are within a distance of lOA 
25 from the CI2 inhibitor which is bound to the substrate binding site. These residues are thus 
believed to be involved in interactions with said substrate: 
29-32. {i.e. residues 29, 30, 31 . 32) 

64-72. (i.e. residues 64. 65, 66. 67. 68, 69. 70, 71 . 72) 



93, 

30 96-98. 



(i.e. residues 96, 97, 98) 

(i.e. residues 100, 101, 102. 103, 104. 105. 106. 107. 108, 109. 110) 



100-110. 
113-114, 
127-136, 
138-141, 



(i.e. residues 127. 128. 129, 130, 131. 132. 133. 134. 135. 136) 
(i.e. residues 138. 139. 140, 141) 



35 144, 157. 174, 
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. 180-183. (i.e. residues 180, 181. 182, 183) 
191,193-194, 

202-207. (i.e. residues 202. 203, 204, 205. 206. 207) 
211. 

5 223-226. (I.e. residues 223. 224, 225, 226) 

234-241. (i.e. residues234.235, 236, 237, 238, 239, 240, 241) 

249-258 (i.e. residues 249. 250, 251 . 252, 253, 254. 255. 256, 257, 258). 

In an embodyment of the present invention a variant comprises a modification in one or 
10 more of the above mentioned positions. A preferred variants is W1 2gL. 

■ 

JP170 with extra ion-binding site 

The Strong ion-binding site from the BPN' subtilases can be transplanted into JP170 (or 
other subtilases in JP170 subgroup) by deletion of N79'N82 and subsequent insertion of 
15 LNNSIGV, followed by the substitution A45D,N and optionally the substitutions E44P.T 
and/or R47Q. 



Removal of ion-binding site in JP170 

By removing an ion-binding site it is possible to decrease the enzymes dependency of cal- 
20 cium in the media. The ion-binding sites in JP170 (or others from JP170 group) can be re- 
moved with guidance from the three-dimensional structure of BPN' and Savinase (or others 
in BPN* group), and of TY145 like subtilases. 

Removal of ion-binding site 1 can be done by deletion of N186-N199 and subsequent in- 
sertion of at least three amino acid residues, preferably the sequence SSN. Preferably, but 
25 not mandatory one or both of the substitutions I7Q and V3Y is further added. 

The ion-binding site 1 can be removed from a wild-type JP170 subtilase or a JP170 subti- 
lase truncated as described above. 



Subtilases free of ion-binding sites 
30 With guidance from the three-dimensional structure of JP170 like subtilases and of TY145 
like subtilases, the Strong and Weak ion-binding sites in BPN' like subtilases can be re- 
moved. Likewise, as described above, with guidance from the three-dimensional structure 
of BPN' and Savinase (or others in BPN' group), and of TY145 like subtilases, all three ion- 
binding sites can be removed from the wild-type JP170 subtilase or from JP170 like subti- 
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lases. The same approach can be used to remove the ion-binding sites from TY145 like 
subtilases. 

Exemplified in Savinase, the removal can be done by altering the loops A194'L196 and . 
L75-L82 either by a) insertion or deletion of a number of amino acid residues in the loops or 
5 b) by deletion of the entire loop or part of the loop and subsequent insertion of a number of 
residues from a corresponding loop of a JP170 or TY145 like subtiiase. 

Preferably the ion-binding sites of Savinase can be removed by either 

i) deletion of or in the region A194-L196 (BPN* numbers) and insertion of three or more 
10 residues chosen from JP170 positions P209*P217 and 

deletion of or in the region L75-L82 (BPN* numbers) and insertion of at least one resi- 
due chosen fromTY145 positions H83-Y92 or 

ii) deletion of or in the region A1 94-L1 96 (BPN' numbers) and insertion of three or more 
residues chosen from JP170 positions P209-P217 and 

15 deletion of or in the region L75-L82 (BPN* numbers) and insertion of at least one resi- 

dues chosen from JP170 positions N79-K83. 

Combined modifications 

The present invention also encompasses any of the above mentioned subtiiase variants in 
-20 combination with any other modification to the amino acid sequence thereof. Especially 
combinations with other modifications known in the art to provide improved properties to 
the enzyme are envisaged. 

Such combinations comprise the positions: 222 (improves oxidation stability), 218 (im- 
proves thermal stability), substitutions in the Ca^^-binding sites stabilizing the enzyme, e.g. 
25 position 76, and many other apparent from the prior art. 

In further embodiments a subtiiase variant described herein may advantageously be com- 
bined with one or more modification (s) in any of the positions: 

30 27, 36. 56. 76. 87, 95. 96. 97, 98, 99, 100. 101, 102. 103, 104, 120, 123, 159. 167, 170, 
206. 218. 222, 224, 232. 235, 236. 245, 248. 252 and 274 (BPN' numbering). 

Specifically, the following BLSAVI. BLSUBL. BSKSMK, and BAALKP modifications are 
considered appropriate for combination: 

35 
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K27R. •36D. S56P, N76D. S87N. G97N, S101G. S103A, V104A. VI 041, V104N. V104Y. 
H120D. N123S, G159D, Y167. R176. Q206E, N21BS. M222S. M222A. T224S. A232V, 
K235L. Q236H, Q245R, N248D, N252K and T274A. 

f 

5 Furthermore variants comprising any of the modifications S101G+V104N. 
S87N+S101G+V104N, K27R+V104Y+N123S+T274A. N76D+S103A+V104I or 
N76D+V104A, or other combinations of the modiflcations K27R, N76D, S101G. S103A, 
V104N, V104Y, V104I, V104A. N123S. G159D, A232V. Q236H, Q245R, N248D, N252K. 
T274A in combination with any one or more of the modification(s) mentioned above exhibit 
10 improved properties. 

A particular interesting variant is a variant, which, in addition to modifications according to 
the invention, contains the following substitutions: 

S101G+S103A+V104KG159D+A232V+Q236H+Q245R+N248D+N252K. 

15 Moreover, subtilase variants of the main aspect(s) of the invention are preferably combined 
with one or more modiftcation(s) in any of the positions 129, 131 and 194, preferably as 
129K, 131H and 194P modifications, and most preferably as P129K. P131H and A194P 
modifications. Any of those modification(s) are expected to provide a higher expression 
level of the subtilase variant in the production thereof. 

20 

Stabilization by modification of Asn-Gly pairs 

It is known that at alkaline pH, the side chain of Asn may interact with the NH group of a 
sequential neighbouring amino acid to form an isoAsp residue where the backbone goes 
through the Asp side chain. This will leave the backbone more vulnerable to proteolysis. 
25 The deamidation is much more likely to occur if the residue that follows is a Gly. Changing 
the Asn in front of the Gly or the Gly will prevent this from happening and thus improve the 
stability, especially as concerns thermo- and storage stability. 

The invention consequently further relates to a subtilase, in which either or both residues of 
30 any of the Asn-Gly sequence appearing in the amino acid sequence of the parent RP-II 
protease is/are deleted or substituted with a residue of a different amino acid. 
The Asn and/or Gly residue may, for instance, be substituted with a residue of an amino 
acid selected from the group consisting of A. Q. S, P, T and Y. 

35 

29 



10321.000-DK 



Modification of Tyrosine residues 

In relation to wash performance it has been found that the modification of certain tyrosine 
residues to phenylalanine provides an improved wash perfomiance. Without being bound by 
any specific theory, it is believed that titration of these Tyr residues in the ali^aline wash liquor 
has negative effects that are alleviated by replacing the Tyr residues with other residues, es- 
pecially Phe or Trp. particularly Phe. 

Methods of preparing JP170 like or BPN' like subtilase variants 

The subtilase variants, i.e. the JP170 and BPN' variants of the present invention may be 
produced by any known method within the art and the present invention also relates to nu- 
oleic acid encoding a subtilase variant of the present invention, a DNA construct comprising 
said nucleic acid and a host cell comprising said nucleic acid sequence. 
In general natural occurring proteins may be produced by culturing the organism express- 
ing the protein and subsequently purifying the protein or it may be produced by cloning a 
nucleic acid, e.g. genomic DNA or cDNA. encoding the protein into an expression vector, 
introducing said expression vector into a host cell, culturing the host cell and purifying the 
expressed protein. 

Typically protein variants may be produced by site-directed mutagenesis of a parent pro- 
tein, introduction into expression vector, host cell etc. The parent protein may be cloned 
from a strain producing the polypeptide or from an expression library, i.e. it may be isolated 
from genomic DNA or prepared from cDNA, or a combination thereof. 
In general standard procedures for cloning of genes and/or introducing mutations (random 
and/or site directed) into said genes may be used in order to obtain a parent subtilase, or 
subtilase or subtilase variant of the invention. For further description of suitable techniques 
reference is made to Molecular cloning: A laboratory manual (Sambrook et al. (1989). Cold 
Spring Harbor lab.. Cold Spring Harbor, NY; Ausubel. F. M. et al. (eds.)); Current protocols 
in Molecular Biology (John Wiley and Sons, 1995; Harwood. C. R.. and Cutting, S. M. 
(eds.)); Molecular Biological Methods for Bacillus (John Wiley and Sons. 1990); DNA Clon- 
ing: A Practical Approach. Volumes I and 11 (D.N. Glover ed. 1985); Oligonucleotide Syn- 
thesis (M.J. Gait ed. 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins eds 

(1985) ); Transcription And Translation (B.D. Hames & S.J. Higgins. eds. (1984)); Animal 
Cell Culture (R.I. Freshney, ed. (1986)); immobilized Cells And Enzymes (IRL Press, 

(1986) ); A Practical Guide To Molecular Cloning (B. Perbal, (1984)) and WO 96/34946. 



Further, variants could be constructed by: 
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Random Mutagenesis 

Random mutagenesis is suitably performed either as localized or region-specific random 
mutagenesis in at least three parts of the gene translating to the amino acid sequence 
5 shown in question, or within the whole gene. 

The random mutagenesis of a DNA sequence encoding a parent subtilase may be conven- 
iently performed by use of any method known in the art. 

In relation to the above, a further aspect of the present invention relates to a method for 
generating a variant of a parent subtilase. wherein the variant exhibits an altered property, 
10 such as increased thermostability, increased stability at low pH and at low calcium concen- 
tration, relative to the parent subtilase, the method comprising: 

(a) subjecting a DNA sequence encoding the parent subtilase to random mutagenesis, 

(b) expressing the mutated DNA sequence obtained in step (a) in a host cell, and 

(c) screening for host cells expressing a subtilase variant which has an altered property 
15 relative to the parent subtilase. 

Step (a) of the above method of the invention is preferably performed using doped primers. 
For Instance, the random mutagenesis may be performed by use of a suitable physical or 
chemical mutagenizing agent, by use of a suitable oligonucleotide, or by subjecting the 
DNA sequence to PCR generated mutagenesis. Furthermore, the random mutagenesis 
20 may be performed by use of any combination of these mutagenizing agents. The 
mutagenizing agent may, e.g., l>e one which induces transitions, transversions, inversions, 
scrambling, deletions, and/or insertions. 

Examples of a physical or chemical mutagenizing agent suitable for the present purpose 
include ultraviolet (UV) irradiation, hydroxylamine, N-methyl-N'~nitro-N-nitrosoguanidine 

25 (MNNG), O-methyl hydroxytamine, nitrous acid, ethyl methane sulphonate (EMS), sodium 
bisulphite, formic acid, and nucleotide analogues. When such agents are used, the 
mutagenesis is typically performed by incubating the DNA sequence encoding the parent 
enzyme to be mutagenized in the presence of the mutagenizing agent of choice under suit- 
able conditions for the mutagenesis to take place, and selecting for mutated DNA having 

30 the desired properties. 

When the mutagenesis is performed by the use of an oligonucleotide, the oligonucleotide 
may be doped or spiked with the three non-parent nucleotides during the synthesis of the 
oligonucleotide at the positions that are to be changed. The doping or spiking may be done 
so that codons for unwanted amino acids are avoided. The doped or spiked oligonucleotide 
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can be incorporated into the DNA encoding the subtilase enzyme by any published tech- 
nique, using, e.g.. PCR, LCR or any DNA polymerase and ligase as deemed appropriate. 
Preferably, the doping is carried out using "constant random doping", in which the percent- 
age of wild-type and modification in each position is predefined. Furthermore, the doping 
5 may be directed toward a preference for the introduction of certain nucleotides, and thereby 
a preference for the introduction of one or more specific amino acid residues. The doping 
may be made, e.g., so as to allow for the introduction of 90% wild type and 10% modifica- 
tions in each position. An additional consideration in the choice of a doping scheme is 
based on genetic as well as protein-structural constraints. The doping scheme may be 

10 made by using the DOPE program which, inter alia, ensures that introduction of stop 
codons is avoided (L.J. Jensen et al. Nucleic Acid Research, 26, 697-702 (1998). 
When PCR-generated mutagenesis is used, either a chemically treated or non-treated 
gene encoding a parent subtilase enzyme is subjected to PGR under conditions that in- 
crease the misincorporation of nucleotides (Deshler 1992; Leung et a(.. Technique, 1, 

15 1989, pp. 11-15). 

A mutator strain of E. coli (Fowler et aL, Molec. Gen. Genet, 133, 1974, 179-191), S. cere- 
viseae or any other microbial organism may be used for the random mutagenesis of the 
DNA encoding the subtilase by, e.g., transforming a plasmid containing the parent enzyme 
into the mutator strain, growing the mutator strain with the plasmid and isolating the mu- 
20 tated plasmid from the mutator strain. The mutated plasmid may be subsequently trans- 
formed into the expression organism. 

The DNA sequence to be mutagenized may conveniently be present in a genomic or cDNA 
library prepared from an organism expressing the parent subtilase. Alternatively, the DNA 
sequence may be present on a suitable vector such as a plasmid or a bacteriophage, 

25 which as such may be incubated with or otherwise exposed to the mutagenising agent. The 
DNA to be mutagenized may also be present in a host cell either by being integrated in the 
genome of said cell or by being present on a vector harbored in the cell. Finally, the DNA to 
be mutagenized may be in isolated form, (t will be understood that the DNA sequence to be 
subjected to random mutagenesis is preferably a cDNA or a genomic DNA sequence. 

30 In some cases it may be convenient to amplify the mutated DNA sequence prior to per- 
forming the expression step b) or the screening step c). Such amplification may be per- 
formed in accordance with methods known in the art, the presently preferred method being 
PCR-generated amplification using oligonucleotide primers prepared on the basis of the 
DNA or amino acid sequence of the parent enzyme. 
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Subsequent to the incubation with or exposure to the mutagenising agent, the mutated 
DNA is expressed by culturing a suitable host cell carrying the DNA sequence under condi- 
tions allowing expression to take place. The host cell used for this purpose may be one 
which has been transformed with the mutated DNA sequence, optionally present on a vec- 
5 tor. or one which was carried the DNA sequence encoding the parent enzyme during the 
mutagenesis treatment. Examples of suitable host cells are the following: gram positive 
bacteria such as Bacillus subtilis^ Bacillus licheniformis. Bacillus lentus. Bacillus brevis. Ba- 
cillus stearothermophilus. Bacillus alkalophilus. Bacillus amyloliquefaciens. Bacillus coagu- 
lans. Bacillus circulans. Bacillus lautus. Bacillus megaterium. Bacillus thuringiensis, Strep- 
10 tomyces lividans or Streptomyces murinus; and gram negative bacteria such as E. coli. 

The mutated DNA sequence may further comprise a DNA sequence encoding functions 
pemnitting expression of the mutated DNA sequence. 



Localised random mutagenesis 

15 The random mutagenesis may be advantageously localised to a part of the parent subtilase 
in question. This may, e.g., be advantageous when certain regions of the enzyme have 
been identified to be of particular importance for a given property of the enzyme, and when 
modified are expected to result in a variant having Improved properties. Such regions may 
normally be identified when the tertiary structure of the parent enzyme has been elucidated 

20 and related to the function of the enzyme. 

The localised or region-specific, random mutagenesis is conveniently performed by use of 
PCR generated mutagenesis techniques as described above or any other suitable tech- 
nique known in the art. Alternatively, the DNA sequence encoding the part of the DNA se- 
quence to be modified may be isolated, e.g., by insertion into a suitable vector, and said 

25 part may be subsequently subjected to mutagenesis by use of any of the mutagenesis 
methods discussed above. 



General method for random mutagenesis by use of the DOPE program 
The random mutagenesis may be carried out by the following steps: 
30 1 . Select regions of interest for modification in the parent enzyme 

2. Decide on mutation sites and non-mutated sites in the selected region 

3. Decide on which kind of mutations should be carried out, e.g. with respect to the 
desired stability and/or performance of the variant to be constructed 

4. Select structurally reasonable mutations 

35 5. Adjust the residues selected by step 3 with regard to step 4. 
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5 



6. 



7. 



9. 



8. 



Analyse by use of a suitable dope algorithm the nucieotide distribution. 
If necessary, adjust the wanted residues to genetic code realisnn, e.g. taking into 
account constraints resulting from the genetic code, e.g. in order to avoid introduc- 
tion of stop codons; the skilled person will be aware that some codon combinations 
cannot be used in practice and will need to be adapted 
Make primers 

Perfonn random mutagenesis by use of the primers 



10. Select resulting subtilase variants by screening for the desired improved properties. 

10 Suitable dope algorithms for use in step 6 are well known in the art. One such algorithm is 
described by Tomandl, D. et al., 1997, Journal of Computer-Aided Molecular Design 11:29- 
38. Another algorithm is DOPE (Jensen. LJ, Andersen, KV, Svendsen, A, and Kretzsch- 
mar. T (1998) Nucleic Acids Research 26:697-702). 

15 Expression vectors 

A recombinant expression vector comprising a nucieic acid sequence encoding a subtilase 
variant of the invention may be any vector that may conveniently be subjected to recombi- 
nant DNA procedures and which may bring about the expression of the nucleic acid se- 
quence. 

20 The choice of vector will often depend on the host cell into which it is to be introduced. Ex- 
amples of a suitable vector include a linear or closed circular plasmid or a virus. The vector 
may be an autonomously replicating vector, i.e., a vector which exists as an extra- 
chromosomal entity, the replication of which is independent of chromosomal replication, 
e.g., a plasmid, an extra-chromosomal element, a mini chromosome, or an artificial chro- 

25 mosome. The vector may contain any means for assuring self-replication. Examples of 
bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19, 
PACYC177, PACYC184, pUBIIO, pE194, pTAIOSO, and pAMB1. Examples of origin of 
replications for use in a yeast host cell are the 2 micron origin of replication, the combina- 
tion of CEN6 and ARS4, and the combination of CEN3 and ARS1. The origin of replication 

30 may be one having a mutation which makes it function as temperature-sensitive tn the host 
cell (see, e.g.. Ehriich, 1978, Proceedings of the National Academy of Sciences USA 
75:1433). 

Alternatively, the vector may be one which, when introduced into the host cell, is integrated 
into the genome and replicated together with the chromosome(s) into which it has been in- 
35 tegrated. Vectors which are integrated into the genome of the host cell riiay contain any 
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nucleic acid sequence enabling integration into the genome, in particular it may contain nu- 
cleic acid sequences facilitating integration into the genome by homologous or non- 
homologous recombination. The vector system may be a single vector, e.g. plasmid or vi- 
rus, or two or more vectors, e.g. plasmids or virus*, which together contain the total DNA to 
5 be introduced into the genome of the host cell, or a transposon. 

The vector may in particular be ah expression vector in which the DNA sequence encoding 
the subtilase variant of the invention is operabiy linked to additional segments or control 
sequences required for transcription of the DNA. The term, "operabiy linked" indicates that 
the segments are arranged so that they function in concert for their intended purposes, e.g. 

10 transcription initiates in a promoter and proceeds through the DNA sequence encoding the 
subtilase variant. Additional segments or control sequences include a promoter, a leader, a 
polyadenylation sequence, a propeptide sequence, a signal sequence and a transcription 
terminator. At a minimum the control sequences include a promoter and transcriptional and 
translational stop signals. 

15 The promoter may be any DNA sequence that shows transcriptional activity in the host cell 
of choice and may be derived from genes encoding proteins either homologous or het- 
erologous to the host cell. 

Examples of suitable promoters for use in bacterial host cells include the promoter of the 
Bacillus subtilis levansucrase gene (sacB), the Bacillus stearothermophilus maltogenic 

20 amylase gene (amyM), the Bacillus //c/7en/7on77/s alpha-amylase gene (amyL), the Bacillus 
amyloliquefaciens alpha-amylase gene (amyQ), the Bacillus subtilis alkaline protease 
gene, or the Bacillus pumilus xylosidase gene, the Bacillus amyloliquefaciens BAN amy- 
lase gene, the Bacillus licheniformis penicillinase gene (penP), the Bacillus subtilis xylA 
and xylB genes, and the prokaryotic beta-lactamase gene (Villa-Kamaroff et al., 1978, Pro- 

25 ceedings of the National Academy of Sciences USA 75:3727-3731). Other examples in- 
clude the phage Lambda Pr or Pl promoters or the E. coli lac, trp or tac promoters or the 
Streptomyces coelicolor agarase gene (dagA). Further promoters are described in "Useful 
proteins from recombinant bacteria'* in Scientific American, 1980, 242:74-94; and in Sam- 
brook et al., 1989, supra. 

30 Examples of suitable promoters for use in a filamentous fungal host cell are promoters ob- 
tained from the genes encoding Aspergillus oryzae TAKA amylase, Rhizomucor miehei as- 
partic proteinase, Aspergillus niger neutral alpha-amylase, Aspergillus niger acid stable al- 
pha-amylase, Aspergillus niger or Aspergillus awamori glucoamylase (glaA), Rhizomucor 
miehei lipase, Aspergillus oryzae alkaline protease, Aspergillus oryzae triose phosphate 

35 isomerase, Aspergillus nidulans acetamidase, Fusarium oxysporum trypsin-like protease 
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(as described in U.S. Patent No. 4,288,627, which is incorporated herein by reference), and 
hybrids thereof. Particularly preferred promoters for use in filamentous fungal host cells are 
the TAKA amylase, NA2-tpi (a hybrid of the promoters from the genes encoding Aspergillus 
niger neutral (-amylase and Aspergillus oryzae triose phosphate isomerase)^ and glaA 
5 prompters. Further suitable promoters for use in filamentous fungus host cells are the 
ADH3 promoter (McKnight et al.. The EMBO J. 4 (1985). 2093 - 2099) or the tpiA promoter. 
Examples of suitable promoters for use in yeast host cells include promoters from yeast 
glycolytic genes (Hitzeman et al., J. Biol. Chem. 255 (1980). 12073 - 12080; Alber and Ka- 
wasaki, J. Mol. Appl. Gen. 1 (1982), 419 - 434) or alcohol dehydrogenase genes (Young et 
10 al., in Genetic Engineering of Microorganisms for Chemicals (Hollaender et al, eds,). Ple- 
num Press. New York. 1982), or the TPI1 (US 4,599,311) or ADH2-4C (Russell et al., Na- 
ture 304 (1983). 652 - 654) promoters. 

Further useful promoters are obtained from the Saccharomyces cerevisiae enolase (ENO- 
1) gene, the Saccharomyces cerevisiae galactokinase gene (GAL1), the Saccharomyces 

15 cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase genes 
(ADH2/GAP), and the Saccharomyces cerevisiae 3-phosphoglycerate kinase gene. Other 
useful promoters for yeast host cells are described by Romanes et aL, 1992, Yeast 8:423- 
488. In a mammalian host cell, useful promoters include viral promoters such as those 
from Simian Virus 40 (SV40), Rous sarcoma virus (RSV), adenovirus, and bovine papil- 

20 loma virus <BPV). 

Examples of suitable promoters for use in mammalian cells are the SV40 promoter 
(Subramani et al., Mol. Cell Biol. 1 (1981), 854 -864), the MT-1 (metallothionein gene) pro- 
moter (Palmiter et al.. Science 222 (1983), 809 - 814) or the adenovirus 2 major late pro- 
moter. 

25 An example of a suitable promoter for use in insect cells is the polyhedrin promoter (US 
4,745,051; Vasuvedan et al.. FEBS Lett. 311, (1992) 7 - 11). the P10 promoter (J.M. Vlak 
et al., J. Gen. Virology 69, 1988, pp. 765-776), the Autographa californica polyhedrosis vi- 
rus basic protein promoter (EP 397 485), the baculovirus immediate early gene 1 promoter 
(US 5,155.037; US 5.162,222), or the baculovirus 39K delayed-early gene promoter (US 

30 5,155,037; US 5,162,222). 

The DNA sequence encoding a subtilase variant of the invention may also, if necessary, be 
operably connected to a suitable terminator. 

The recombinant vector of the invention may further comprise a DNA sequence enabling 
the vector to replicate in the host cell in question. 
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The vector may also comprise a selectable marker, e.g. a gene the product of which com- 
plements a defect in the host cell, or a gene encoding resistance to e.g. antibiotics like am- 
picillin, kanamycin, chloramphenicol, erythromycin, tetracycline, spectinomycine, neomycin, 
hygromycin, methotrexate, or resistance to heavy metals, virus or herbicides, or which pro- 
5 vides for prototrophy or auxotrophs. Examples of bacterial selectable maricers are the dal 
genes from Bacillus subtilis or Bacillus lichenifomiis, resistance. A frequently used mam- 
malian marker is the dihydrofolate reductase gene (DHFR). Suitable markers for yeast 
host cells are ADE2, HIS3, LEU2. LYS2. MET3. TRP1, and URA3. A selectable marker for 
use in a filamentous fungal host cell may be selected from the group including, but not lim- 

10 ited to, amdS (acetamidase), argB (ornithine carbamoyltransferase), bar (phosphinothricin 
acetyltransferase), hygB (hygromycin phosphotransferase). niaD (nitrate reductase), pyrG 
(orotidine-5'-phosphate decarboxylase), sC (sulfate adenyltransferase), trpC (anthranilate 
synthase), and glufosinate resistance markers, as well as equivalents from other species. 
Particularly, for use in an Aspergillus cell are the amdS and pyrG markers of Aspergillus 

15 nidulans or Aspergillus oryzae and the bar marker of Streptomyces hygroscopicus. Fur- 
thermore, selection may be accomplished by co-transformation, e.g., as described in WO 
91/17243, where the selectable marker is on a separate vector. 

To direct a subtilase variant of the present invention into the secretory pathway of the host 
cells, a secretory signal sequence (also known as a leader sequence, prepro sequence or 

20 pre sequence) may be provided in the recombinant vector. The secretory signal sequence 
is joined to the DNA sequence encoding the enzyme in the correct reading frame. Secre- 
tory signal sequences are commonly positioned 5' to the DNA sequence encoding the en- 
zyme. The secretory signal sequence may be that normally associated with the enzyme or 
may be from a gene encoding another secreted protein. 

25 The procedures used to ligate the DNA sequences coding for the present enzyme, the pro- 
moter and optionally the terminator and/or secretory signal sequence, respectively, or to 
assemble these sequences by suitable PCR amplification schemes, and to insert them into 
suitable vectors containing the information necessary for replication or integration, are well 
known to persons skilled in the art (cf., for instance, Sambrook et al.). 

30 More than one copy of a nucleic acid sequence encoding an enzyme of the present inven- 
tion may be inserted into the host cell to amplify expression of the nucleic acid sequence. 
Stable amplification of the nucleic acid sequence can be obtained by integrating at least 
one additional copy of the sequence into the host cell genome using methods well known in 
the art and selecting for transformants. 
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The nucleic acid constructs of the present invention may also comprise one or more nucleic 
acid sequences which encode one or more factors that are advantageous in the expression 
of the polypeptide, e.g., an activator (e.g., a trans-acting factor), a chaperone, and a proc- 
essing protease. Any factor that is functional in the host cell of choice may be used in the 
5 present invention. The nucleic acids encoding one or more of these factors are not neces- 
sarily in tandem with the nucleic acid sequence encoding the polypeptide. 

Host cells 

The DNA sequence encoding a subtilase variant of the present invention may be either 
10 homologous or heterologous to the host cell into which it is introduced. If homologous to 
the host cell, i.e. produced by the host cell in nature, it will typically be operably connected 
to another promoter sequence or, if applicable, another secretory signal sequence and/or 
terminator sequence than in its natural environment. The term "homologous" is intended to 
include a DNA sequence encoding an enzyme native to the host organism in question. The 
15 term "heterologous" is intended to include a DNA sequence not expressed by the host cell 
In nature. Thus, the DNA sequence may be from another organism, or it may be a synthetic 
sequence. 

The host cell into which the DNA construct or the recombinant vector of the invention is 
introduced may be any cell that is capable of producing the present subtilase variants, such 
20 as prokaryotes, e.g. bacteria or eukaryotes, such as fungal cells, e.g. yeasts or filamentous 
fungi, insect cells, plant cells or mammalian cells. 

Examples of bacterial host cells which, on cultivation, are capable of producing the subti- 
lase variants of the invention are gram-positive bacteria such as strains of Bacillus, e.g. 
strains of B. subWis, B. licheniformis, B. lentus, S. brevis, fi. stearothermophilus, B. alkalo- 
25 philus, B, amyloliquefaciens, B. coagulans, 8. circulans, B. lautus, B. megaterium or B. 
thuringiensis, or strains of Streptomyces, such as S. lividans or S. murinus, or gram- 
negative bacteria such as Escherichia coli or Pseudomonas sp. 

The transformation of the bacteria may be effected by protoplast transformation, electropo- 
ration, conjugation, or by using competent cells in a manner known per se (cf. Sambrook et 
30 al., supra). 

When expressing the subtilase variant in bacteria such as E. coli, the enzyme may be re- 
tained in the cytoplasm, typically as insoluble granules (known as inclusion bodies), or it 
may be directed to the periplasmic space by a bacterial secretion sequence. In the former 
case, the cells are lysed and the granules are recovered and denatured after which the en- 
35 zyme is refolded by diluting the denaturing agent. In the latter case, the enzyme may be 
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recovered from the periplasmic space by disrupting the cells, e.g. by sonicatlon or osmotic 
shock, to release the contents of the periplasmic space and recovering the enzyme. 
When expressing the subtilase variant in gram-positive bacteria such as Bacillus or Strep- 
tomyces strains, the enzyme may be retained in the cytoplasm, or it may be directed to the 
5 extracellular medium by a bacterial secretion sequence. In the latter case, the enzyme may 
be recovered from the medium as described below. 

Examples of host yeast cells include cells of a species of Candida, Kluyveromyces, Sac- 
charomyces, Schizosacchammyces, Pichia, Hansehula, or Yarrowia. In a particular em- 
bodiment, the yeast host cell is a Saccharomyces carlsbergensis, Saccharomyces cere- 

10 visiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, 
Saccharomyces norbensis or Saccharomyces oviformis cell. Other useful yeast host cells 
are a Kluyveromyces lactis, Kluyveromyces fragilis, Hansehula polymorpha, Pichia pas- 
toris, Yarrowia lipolytica, Schizosaccharomyces pombe, Ustilgo maylis, Candida maltose, 
Pichia guillermondii and Pichia methanolio cell (cf. Gleeson et al., J. Gen. Microbiol. 132. 

15 1986, pp. 3459-3465; US 4,882,279 and US 4,879,231). Since the classification of yeast 
may change in the future, for the purposes of this invention, yeast shall be defined as de- 
scribed in Biology and Activities of Yeast (Skinner, F.A., Passmore, S.M., and Davenport. 
R.R., eds, Soc. App; Bacteriol. Symposium Series No. 9, 1980. The biology of yeast and 
manipulation of yeast genetics are well known in the art (see, e.g., Biochemistry and Ge- 

20 netics of Yeast. Bacil, M.. Horecker, B.J., and Stopani, A.O.M., editors, 2nd edition. 1987; 
The Yeasts, Rose, A.H.. and Harrison, J.S., editors, 2nd edition, 1987; and The Molecular 
Biology of the Yeast Saccharomyces, Strathem et al., editors, 1981). Yeast may be trans- 
formed using the procedures described by Becker and Guarente, In Abelson, J.N. and 
Simon, M.I.. editors. Guide to Yeast Genetics and Molecular Biology, Methods in Enzymol- 

25 ogy, Volume 194, pp 182-187, Academic Press, Inc., New York; Ito et a!., 1983, Journal of 
Bacteriology 153:163; and Hinnen et al., 1978, Proceedings of the National Academy of 
Sciences USA 75:1920. 

Examples of filamentous fungal cells include filamentous forms of the subdivision Eumy- 

cota and Oomycota (as defined by Hawksworth et al., 1995, supra), in particular it may of 
30 the a cell of a species of Acremonium, such as A. chrysogenum, Aspergillus, such as A, 

awamori, A. foetidus, A. japonicus, A. niger, A. nidulans or A, oryzae, Fusarium, such as F. 

bactridioides, F. cerealis, F. crookwellense, F. culmorum, F. graminearum, F. graminum, F. 

heterosporum, F. negundi, F. reticulatum, F. roseum, F. sambucinum, F. sarcochroum, F. 

sulphureum, F. trichothecioides or F. oxysporum, Humicola, such as H. insolens or H, la- 
ss nuginose, Mucor, such as M. miehei, Myceliophthora, such as M thermophilum, Neuro- 
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spora, such as N, crassa, Penicillium, such as P. purpurogenum, Thielavia, such as 7. ter- 
restris, Tolypocladium, or Trichodema, such as 7. harzianum, T. koningii, T. longibrachia- 
turn, T. reesei or T. viride, or a teleomorph or synonym thereof. The use of Aspergillus spp. 
for the expression of proteins is described in. e.g., EP 272 277. EP 230 023. 
5 Examples of insect cells include a Lepidoptera cell line« such as Spodoptera frugiperda 
cells or Trichoptusia ni cells (cf. US 5,077,214). Culture conditions may suitably be as de- 
scribed in WO 89/01029 or WO 89/01 028.Transformation of insect cells and production of 
heterologous polypeptides therein may be performed as described in US 4,745,051; US 4, 
775, 624; US 4,879.236; US 5.155,037; US 5,162.222; EP 397.485). 

10 Examples of mammalian ceils include Chinese hamster ovary (CHO) cells, HeLa cells, 
baby hamster kidney (BHK) cells. COS cells, or any number of other immortalized cell lines 
available, e.g.. from the American Type Culture Collection. Methods of transfecting mam- 
malian cells and expressing DNA sequences introduced in the cells are described in e.g. 
Kaufman and Sharp. J. Mol. Biol. 159 (1982), 601 - 621; Southern and Berg. J. Mol. Appl. 

15 Genet. 1 (1982). 327 - 341; Loyter et al.. Proc. Natl. Acad. Sci. USA 79 (1982). 422 - 426; 
Wigler et al., Cell 14 (1978), 725; Corearo and Pearson, Somatic Cell Genetics 7 (1981). 
603, Ausubel et al., Current Protocols in Molecular Biology. John Wiley and Sons, Inc., 
N.Y., 1987, Hawley-Nelson et al., Focus 15 (1993), 73; Ciccarone et al.. Focus 15 (1993), 
80; Graham and van der Eb, Virology 52 (1973), 456; and Neumann et al., EMBO J. 1 

20 (1982), 841 - 845. Mammalian cells may be transfected by direct uptake using the calcium 
phosphate precipitation method of Graham and Van der Eb (1978, Virology 52:546). 

Methods for expression and isolation of proteins 

To express an enzyme of the present invention the above mentioned host cells trans- 
25 formed or transfected with a vector comprising a nucleic acid sequence encoding an en- 
zyme of the present invention are typically cultured in a suitable nutrient medium under 
conditions permitting the production of the desired molecules, after which these are recov- 
ered from the cells, or the culture broth. 

The medium used to culture the host cells may be any conventional medium suitable for 
30 growing the host cells, such as minimal or complex media containing appropriate supple- 
ments. Suitable media are available from commercial suppliers or may be prepared accord- 
ing to published recipes (e.g. in catalogues of the American Type Culture Collection). The 
media may be prepared using procedures known in the art (see, e.g.. references for bacte- 
ria and yeast; Bennett, J.W. and LaSure, L., editors, More Gene Manipulations in Fungi, 
35 Academic Press. CA, 1991). 

40 



10321 .OOO'DK 

If the enzymes of the present invention are secreted into the nutrient medium, they may be 
recovered directly from the medium. If they are not secreted, they may be recovered from 
cell lysates. The enzymes of the present invention may be recovered from the culture me- 
dium by conventional procedures including separating the host cells from the medium by 
5 centrifugation or filtration, precipitating the proteinaceous components of the supematant or 
filtrate by means of a salt, e.g. ammonjum sulphate, purification by a variety of chroma- 
tographic procedures, e.g. ion exchange chromatography, gelfiltration chromatography, af- 
finity chromatography, or the like, dependent on the enzyme in question. 
The enzymes of the invention may be detected using methods known in the art that are 
10 specific for these proteins. These detection methods include use of specific antibodies, 
formation of a product, or disappearance of a substrate. For example, an enzyme assay 
may be used to determine the activity of the molecule. Procedures for determining various 
kinds of activity are known in the art. 

The enzymes of the present invention may be purified by a variety of procedures known in 
15 the art including, but not limited to, chromatography (e.g., ion exchange, affinity, hydropho- 
bic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., preparative 
isoelectric focusing (lEF), differential solubility (e.g., ammonium sulfate precipitation), or 
extraction (see, e.g., Protein Purification, J-C Janson and Lars Ryden, editors, VCH Pub- 
lishers, New York, 1989). 
' 20 When an expression vector comprising a DNA sequence encoding an enzyme of the pre- 
sent invention is transformed/transfected into a heterologous host cell it is possible to en- 
able heterologous recombinant production of the enzyme. An advantage of using a het- 
erologous host cell is that it is possible to make a highly purified enzyme composition, char- 
acterized in being free from homologous impurities, which are often present when a protein 
25 or peptide is expressed in a homologous host cell. In this context homologous impurities 
mean any impurity (e.g. other polypeptides than the enzyme of the invention) which 
originates from the homologous cell where the enzyme of the invention is originally ob- 
tained from. 

30 DETERGENT APPLICATIONS 

The enzyme of the invention may be added to and thus become a component of a deter- 
gent composition. 

The detergent composition of the invention may for example be formulated as a hand or 
machine laundry detergent composition including a laundry additive composition suitable 
35 for pre-treatment of stained fabrics and a rinse added fabric softener composition, or be 
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formulated as a detergent composition for use in general household hard surface cleaning 
operations, or be formulated for hand or machine dishwashing operations, 
in a specific aspect, the invention provides a detergent additive comprising the enzyme of 
the invention. The detergent additive as well as the detergent composition may comprise 
5 one or more other enzymes such as a protease, a lipase, a cutinase, an amylase, a carbo- 
hydrase, a cellulase, a pectinase, a mannanase, an arabinase, a galactanase, a xylanase, 
an oxidase, e.g., a laccase. and/or a peroxidase. 

In general the properties of the chosen enzyme(s) should be compatible with the selected 
detergent, (i.e. pH-optimum, compatibility with other enzymatic and non-enzymatic ingredi- 
10 ents, etc.). and the enzyme(s} should be present in effective amounts. 

Proteases : Suitable proteases include those of animal, vegetable or microbial origin. Mi- 
crobial origin is preferred. Chemically modified or protein engineered mutants are included. 
The protease may be a serine protease or a metallo protease, preferably an alkaline mi- 

15 crobial protease or a trypsin-fike protease. Examples of alkaline proteases are subtilisins, 
especially those derived from Bacillus, e.g.. subtilisin Novo, subtilisin Carlsberg, subtilisin 
309, subtilisin 147 and subtilisin 168 (described in WO 89/06279). Examples of trypsin-like 
proteases are trypsin (e.g. of porcine or bovine origin) and the Fusanum protease de- 
scribed in WO 89/06270 and WO 94/25583. 

20 Examples of useful proteases are the variants described in WO 92/19729, WO 98/20115, 
WO 98/20116. and WO 98/34946. especially the variants with substitutions in one or more 
of the following positions: 27. 36, 57. 76. 87, 97. 101, 104, 120, 123, 167. 170, 194. 206, 
218, 222. 224, 235 and 274. 

Preferred commercially available protease enzymes include Alcalase™. Savinase™, Pri- 
25 mase^M Duralase™, Esperase™, and Kamase™ (Novo Nordisk A/S). Maxatase™, Max- 
acal™, Maxapem™, Properase™, Purafect^", Purafect OxP^", FHZ^, and FN3™ (Genen- 
cor International Inc.). 

Lipases : Suitable lipases include those of bacterial or fungal origin. Chemically modified or 
30 protein engineered mutants are included. Examples of useful lipases include lipases from 
Humicola (synonym Thermomyces), e.g. from H. lanuginosa (7. lanuginosus) as described 
in EP 258 068 and EP 305 216 or from K insolens as described in WO 96/13580, a Pseu- 
domonas lipase, e.g. from P. alcaligenes or P. pseudoalcaligenes (EP 218 272). P. ce- 
pacia (EP 331 376), P. stutzeri (GB 1,372.034), P fluorescens, Pseudomonas sp. strain 

35 SD 705 (WO 95/06720 and WO 96/27002). P. wisconsinensis (WO 96/12012), a Bacillus 
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lipase, e.g. from 6. subtilis (Dartois et al. (1993), Biochemica et Biophysica Acta, 1131, 
253-360), S. stearothermophilus <JP 64/744992) or B. pumilus (WO 91/16422). 
Other examples are lipase variants such as those described in WO 92/05249, WO 
94/01541. EP 407 225, EP 260 105. WO 95/35381, WO 96/00292, WO 95/30744. WO 
5 94/25578. WO 95/14783, WO 95/2261 5, WO 97/04079 and WO 97/07202. 

Preferred commercially available lipase enzymes include Lipolase^" and Lipolase Ultra™ 
(Novo Nordisk A/S). 

Amylases: Suitable amylases (a and/or p) include those of bacterial or fungal origin. 
10 Chemically modified or protein engineered mutants are included. Amylases include, for ex- 
ample, a-amylases obtained from Bacillus, e.g. a special strain of 6. licheniformis, de- 
scribed in more detail in GB 1 ,296,839. 

Examples of useful amylases are the variants described in WO 94/02597, WO 94/1 831 4^ 
WO 96/23873, and WO 97/43424, especially the variants with substitutions in one or more 
15 of the following posftions: 15, 23, 105. 106. 124, 128. 133, 154, 156, 181, 188. 190, 197, 
202, 208, 209, 243. 264. 304, 305, 391 . 408, and 444. 

Commercially available amylases are Duramyr*^, Termamyl™. Fungamyl™ and BAN™ 
(Novo Nordisk A/S), Rapidase™ and Purastar™ (from Genencor International Inc.). 

20 Cellulases : Suitable cellulases include those of bacterial or fungal origin. Chehnically modi- 
fied or protein engineered mutants are included. Suitable cellulases include cellulases from 
the genera Bacillus, Pseudomonas, Humicola, Fusarium, Thielavia, Acremonium, e.g. the 
fungal cellulases produced from Humicola insolens, Myceliophthora therrnophila and Fusa- 
rium oxysporum disclosed in US 4,435.307. US 5,648,263. US 5.691.178, US 5.776,757 

25 and WO 89/09259. 

Especially suitable cellulases are the alkaline or neutral cellulases having colour care bene- 
fits. Examples of such cellulases are cellulases described in EP 0 495 257, EP 0 531 372, 
WO 96/11262, WO 96/29397. WO 98/08940. Other examples are cellulase variants such 
as those described in WO 94/07998, EP 0 531 315. US 5,457,046, US 5,686,593. US 

30 5.763,254, WO 95/24471 , WO 98/12307 and PCT/DK98/00299. 

Commercially available cellulases include Celluzyme™, and Carezyme™ (Novo Nordisk 

A/S), Clazinase''^. and Puradax HA'^'^ (Genencor International Inc.), and KAC-500(B)™ 
(Kao Corporation). 
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Peroxidases/Oxidases: Suitable peroxidases/oxidases include those of plant, bacterial or 
fungal origin. Chemically. modified or protein engineered mutants are included. Examples of 
useful peroxidiases include peroxidases from Coprinus, e.g. from Q cinereus, and variants 
thereof as those described in WO 93/24618, WO 95/10602. and WO 98/15257. 
5 Commercially available peroxidases include Guardzyme^ (Novo Nordisk A/S). 

The detergent enzyme(s) may be included in a detergent composition by adding separate 
additives containing one or more enzymes, or by adding a combined additive comprising all 
of these enzymes. A detergent additive of the invention, i.e. a separate additive or a com- 
10 bined additive, can be formulated e.g. as a granulate, a liquid, a slurry, etc. Preferred de- 
tergent additive formulations are granulates, in particular non-dusting granulates, liquids, in 
particular stabilized liquids, or slurries. 

Non-dusting granulates may be produced, e.g., as disclosed in US 4,106,991 and 
4,661,452 and may optionally be coated by methods known in the art. Examples of waxy 

15 coating materials are poly(ethylene oxide) products (polyethyleneglycol, PEG) with mean 
molar weights of 1000 to 20000; ethoxylated nonylphenols having from 16 to 50 ethylene 
oxide units; ethoxylated fatty alcohols in which the alcohol contains from 12 to 20 carbon 
atoms and in which there are 15 to 80 ethylene oxide units; fatty alcohols; fatty acids; and 
mono- and di- and triglycerides of fatty acids. Examples of film-forming coating materials 

20 suitable for application by fluid bed techniques are given in GB 1483591. Liquid enzyme 
preparations may, for instance, be stabilized by adding a polyol such as propylene glycol, a 
sugar or sugar alcohol, lactic acid or boric acid according to established methods. Pro- 
tected enzymes may be prepared according to the method disclosed in BP 238,216. 

25 The detergent composition of the invention may be in any convenient form, e.g.. a bar, a 
tablet, a powder, a granule, a paste or a liquid. A liquid detergent may be aqueous, typically 
containing up to 70 % water and 0-30 % organic solvent, or non-aqueous. 

The detergent composition comprises one or more surfactants, which may be non-ionic in- 
30 eluding semi-polar and/or anionic and/or cationic and/or zwitterionic. The surfactants are 
typically present at a level of from 0.1 % to 60% by weight. 

When included therein the detergent will usually contain from about 1% to about 40% of an 
anionic surfactant such as linear alkylbenzenesulfonate, alpha-olefmsulfonate, alkyi sulfate 
(fatty alcohol sulfate), alcohol ethoxysulfate. secondary alkanesulfpnate, alpha-sulfo fatty 
35 acid methyl ester, alkyI- or alkenylsuccinic acid or soap. 

44 



10321. 000-DK 



When included therein the detergent will usually contain from about 0.2% to about 40% of a 
nonHonic surfactant such as alcohol ethoxylate, nonylphenol ethoxylate, alkylpolyglycoside, 
alkyldimethylannineoxide, ethoxylated fatty acid monoethanolamide, fatty acid monoetha- 
nolamide, polyhydroxy alkyi fatty acid amide, or N-acyl N-alkyI derivatives of glucosamine 
5 ("glucamides"). 

The detergent may contain 0-65 % of a detergent builder or complexing agent such as zeo- 
lite, diphosphate, triphosphate, phosphonate, carbonate, citrate, nitrilotriacetic acid, ethyl- 
enediaminetetraacetic acid, diethylenetriaminepentaacetic acid, alkyl- or alkenylsuccinic 
10 acid, soluble silicates or layered silicates (e.g. SKS-6 from Hoechst). 

The detergent may comprise one or more polymers. Examples are carboxymethylcellulose, 
poly(vinyipyrrolidone), poly (ethylene glycol). poly(vinyl alcohol), poly(vinylpyridine-N- 
oxide), poly(vinylimidazole), polycarboxylates such as polyacrylates. maleic/acrylic acid co- 
15 polymers and lauryl methacrylate/acrylic acid copolymers. 

The detergent may contain a bleaching system which may comprise a H2O2 source such as 
perborate or percarbonate which may be combined with a peracid-forming bleach activator 
such as tetraacetylethylenediamine or nonanoyloxybenzenesulfonate. Alternatively, the 
20 bleaching system may comprise peroxyacids of e.g. the amide, imide, or sulfone type. 

The enzyme(s) of the detergent composition of the invention may be stabilized using con- 
ventional stabilizing agents, e.g., a polyol such as propylene glycol or glycerol, a sugar or 
sugar alcohol, lactic acid, boric acid, or a boric acid derivative, e.g., an aromatic borate es- 
25 ter, or a phenyl boronic acid derivative such as 4-formylphenyl boronic acid, and the com- 
position may be formulated as described in e.g. WO 92/19709 and WO 92/19708. 

The detergent may also contain other conventional detergent ingredients such as e.g. fab- 
ric conditioners including clays, foam boosters, suds suppressors, anti-corrosion agents, 
30 soil-suspending agents, anti-soil redeposition agents, dyes, bactericides, optical bright- 
eners, hydrotropes, tarnish inhibitors, or perfumes. 

In the detergent compositions any enzyme, in particular the enzyme of the invention, may 
be added in an amount corresponding to 0.01-100 mg of enzyme protein per litre of wash 
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liquor, preferably 0.05-5 mg of enzyme protein per litre of wash liquor, in particular 0.1-1 
mg of enzyme protein per litre of wash liquor. 

The enzyme of the invention may additionally be incorporated in the detergent formulations 
disclosed in WO 97/07202 which is hereby incorporated as reference. 

5 

MATERIALS AND METHODS 
Textiles 

Standard textile pieces are obtained from EMPA St. Gallon. Lerchfeldstrasse 5, CH-9014 
10 St. Gallen, Switzerland. Especially type EMPA 116 (cotton textile stained with blood, milk 
and ink) and EMPA 117 (polyester/cotton textile stained with blood, milk and ink). 

Method for producing a subtilase variant 

The present invention provides a method of producing an isolated enzyme according to the 
15 invention, wherein a suitable host cell, which has been transfomr^ed with a DNA sequence 
encoding the enzyme, is cultured under conditions permitting the production of the enzyme, 
and the resulting enzyme is recovered from the culture. 

When an expression vector comprising a DNA sequence encoding the enzyme is trans- 
formed into a heterologous host cell it is possible to enable heterologous recombinant pro- 
20 duction of the enzyme of the invention. Thereby it is possible to make a highly purified sub- 
tilase composition, characterized in being free from homologous impurities. 

The medium used to culture the transformed host cells may be any conventional medium 
suitable for growing the host cells in question. The expressed subtilase may conveniently 
25 be secreted into the culture medium and may be recovered there-from by well-known pro- 
cedures including separating the cells from the medium by centrifugation or filtration, pre- 
cipitating proteinaceous components of the medium by means of a salt such as ammonium 
sulfate, followed by chromatographic procedures such as ion exchange chromatography, 
affinity chromatography, or the like. 

30 

EXAMPLE 1 

Removal of ion-binding sites from BPN' like subtilases 

The below mentioned regions in JP170 and TY145 have been selected for transfer from 
JP170 and TY145 to Savinase. By use of the molecular methods of preparing subtilase 
35 variants as described herein, the Savinase regions (BPN' numbering) are deleted and the 
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JP170 and TY145 regions are inserted instead. Since the Savinase regions are in contact 
with ion-binding sites, the purpose of the modifications is to remove the ion-binding site 
from Savinase. 

5 Savinase region A194-L1 96 

JP1 70 region P209-P21 7 and 

Savinase region L75-L82 

TY145 region H83-Y92. 

10 alternatively the modification can be 

Savinase region A194-L1 96 

JP170 region P209-P21 7 and 

Savinase region L75-L82 

15 JP170 region N79-K83. 

EXAMPLE 2 

Purification and assessment of enzvme concentration 

After fermentation, purification of subtilisin variants is accomplished using Hydrophobic 
20 Charge Induction Chromatography (HCIC) and subsequent vacuum filtration. 

To capture the enzyme, the HCIC uses a cellulose matrix to which 4-Mercapto-Ethyl- 
Pyridine (4-MEP) is bound. 

Beads of the cellulose matrix sized 80-100 pm are mixed with a media containing yeast 
25 and the transformed B. subtilis capable of secreting the subtilisin variants and incubated at 
pH 9.5 in Unifilter® microplates. 

As 4-MEP is hydrophobic at pH > 7 and the subtilisin variants are hydrophobic at pH 9.5 a 
hydrophobic association is made between the secreted enzyme and the 4-MEP on the 
beads. After incubation the media and cell debris is removed by vacuum filtration while the 
30 beads and enzyme are kept on the filter. 

To elute the enzyme from the beads the pH is now lowered by washing the filter with an 
elution buffer (pH 5). Hereby the enzymes part from the beads and can be retrieved from 
the buffer. 
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The concentration of the purified subtilisin enzyme variants is assessed by active site titra- 
tion (AST). 

The purified enzyme is incubated with the high affinity inhibitor CI-2A at different concentra- 
tions to inhibit a varying amount of the active sites. The protease and inhibitor binds to 
5 each other at a 1:1 ratio and accordingly the enzyme concentration can be directly related 
to the concentration of inhibitor, at which all protease is inactive. To measure the residual 
protease activity, a substrate (0.6 mM Suc-Ala-Ala-Pro-Phe-pNA in Tris/HCI buffer) is 
added after the incubation with inhibitor and during the following 4 minutes the develop- 
ment of the degradation product pNA (paranitrophenol) is measured periodically at 405 nm 
10 on an Elisa Reader. 

EXAMPLE 3 

Wash performance of detergent compositions comprisino modified enzvmes 
Wash performance of detergent compositions comprising enzyme hybrids or enzyme vari- 
15 ants of the present is tested at low washing temperature. 

AMSA 

The enzyme variants of the present application are tested using the Automatic Mechanical 
Stress Assay (AMSA). With the AMSA test the wash performance of a large quantity of 

20 small volume enzyme-detergent solutions can be examined. The AMSA plate has a 
number of slots for test solutions and a lid firmly squeezing the textile swatch to be washed 
against all the slot openings. During the washing time, the plate, test solutions, textile and 
iid are vigorously shaken to bring the test solution in contact with the textile and apply 
mechanical stress. For further description see WO 02/42740 especially the paragraph 

25 "Special method embodiments" at page 23-24. 



The assay is conducted under the experimental conditions specified below: 



Detergent base 


Standard European detergent 


Detergent dosage 


1.5 g/l 


Test solution volume 


160 micro 1 


pH 


10-10.5 adjusted with NaHCOa 


Wash time 


12 minutes 


Temperature 


20°C 
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Water hardness 


9°dH 


Enzyme concentration in. test solution 


5 nM, 10 nM and 30 nM 


Test material 


EMPA 117 



After washing the textile pieces are flushed in tap water and air-dried. 
The perfonmance of the enzyme variant is measured as the brightness of the colour of the 
textile samples washed with that speciric enzyme variant. Brightness can also be 
5 expressed as the intensity of the light reflected from the textile sample when luminated with 
white light. When the textile is stained the intensity of the reflected light is lower, than that 
of a clean textile. Therefore the intensity of the reflected light can be used to measure wash 
performance of an enzyme variant. 

Colour measurements are made with a professional flatbed scanner {PFU DL2400pro), 
10 which is used to capture an image of the washed textile samples. The scans are made with 
a resolution of 200 dpi and with an output colour dept of 24 bits. In order to get accurate 
results, the scanner is frequently calibrated with a Kodak reflective ITS target 

To extract a value for the light intensity from the scanned images, a special designed 
15 software application is used {Novozymes Color Vector Analyzer). The program retrieves 
the 24 bit pixel values from the image and converts them into values for red, green and 
blue (RGB), The intensity value (Int) is calculated by adding the RGB values together as 
vectors and then taking the length of the resulting vector: 

Int =^r^ jfg^ ^b^ 

20 

The wash performance (P) of the variants is calculated in accordance with the below 
formula: 

P = Int(v) ^ Int(r) 

where 

25 Int(v) is the light intensity value of textile surface washed with enzyme variant and 

Int(r) is the light intensity value of textile surface washed with the reference enzyme e.g. 
subtilisin 309 (BLSAVI). 

Performance Scores (S) are summing up the performances (P) of the tested enzyme 
30 variants as: 

S (2) which indicates that the variant performs better than the reference at all three 
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concentrations (5, 10 and 30 nM) and 

S (1) which indicates that the variant performs better than the reference at one or two 
concentrations. 



5 Mini wash assay 

A millilitre scale wash performance assay is conducted under the following conditions: 



Detergent base 


Standard European detergent powder 


Detergent dose 


1.5 gyi 


pH 


"as is" in the current detergent solution and is not adjusted. 


Wash time 


14 min. 


Temperature 


20°C 


Water hardness 


9"dH. adjusted by adding CaCl2*2H20; MgCU^BHzO; Na- 
HCO3 (Ca2*:Mg^":HCO^- = 2:1:6) to milli-Q water. 


Enzyme cone. 


5nM. 10 nM 


Test system 


125 ml glass beakers. Textile dipped in test solution. Con> 
tinuously up and down, 50 times per minute 


Textile/volume 


1 textile piece (13x3 cm) in 50 ml test solution 


Test material 


EM PA 117 textile swatches 



After wash the measurement of remission from the test material is done at 460 nm using a 
10 Zeiss MCS 521 VIS spectrophotometer. The measurements are done according to the 
manufacturer's protocol. 



15 



20 
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CLAIMS 



Modtaget PVS 
2 1 MRS. 20Q3 



.1. A JP170 like subtilase which is at least 58% homologous to the sequence of SEQ ID 
NO:1, comprising the overall subtilisin fold and the following structural characteristics: 
5 a) a twisted beta-sheet with 7 strands, 

b) six alpha helices, 

c) three ion-binding sites. 

and not comprising the Strong and Weak ion-binding sites of the BPN' like subtilases, and 
with the exception of the subtilases JP170, KP1790, KP9860, KP43. Y. SD-521 and vari- 
10 ants aamSOOgO, aam50086, aam50085, aam50084, aam50083, aam50082, aam50081, 
aam50080 of EP 1209233. 

2. The subtilase of ciaim 1. wherein the positions of said three ion-binding sites in the 
three-dimensional structure of the subtilase is defined by the distance to the c-alpha atoms 
15 of the three active site amino acid residues of the subtilases, that is Ser, His and Asp, and 
the c-alpha atom of the amino acid residue next to the active site Ser residue (next to Ser), 
wherein said distances between: 

a) ton-binding site 1 and i) Asp c-alpha atom is 26.70-28.70A, ii) His c-alpha atom is 
22.1 0-24.1 oA, iii) Ser c-alpha atom is 16.95-1 8.95A, iv) next to Ser c-alpha atom is 

20 15.30-17.30A, 

b) ion-binding site 2 and i) Asp c-alpha atom is 33.50-35.50A, ii) His c-alpha atom is 37- 
3gA, iii) Ser c-alpha atom is 29.40-31 .40A, iv) next to Ser c-alpha atom is 30.70- 
32.70A, 

c) ion-binding site 3 and i) Asp c-alpha atom is 41.50-43.50A, ii) His c-alpha atom is 

25 42.90-44.90A, iii) Ser c-alpha atom is 34.50-36.50A, iv) next to Ser c-alpha atom is 35- 

37A. 



3. A subtilase according to claim 2 wherein the positions of the three ion-binding sites are 
defined by the distance to the c-alpha atoms of amino acid residues Asp30, His68, Ser254 
30 and Met255 of SEQ ID NO:1 or by the distances to the c-alpha atoms of equivalent amino 
acid residues in another subtilase of the invention in accordance with claim 1, wherein the 
distance between 

a) ion-binding site 1 and i) Asp c-alpha atom is 27.6gA, ii) His c-alpha atom Is 23.1 2A, iii) 
Ser c-alpha atom is 17.95A, iv) next to Ser c-alpha atom is 16.34A. 
35 b) ion-binding site 2 and i) Asp c-alpha atom is 34.49A, ii) His c-alpha atom is 38.03A. iii) 
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Ser c-alpha atom is 30.41 A. iv) next to Ser c-alpha atom is 31 .68A, 
c) ion-binding site 3 and i) Asp c-alpha atom is 42.48A, ii) His c-alpha atom is 43.87A. iri) 

Ser c-alpha atom is 35.51 A, rv} next to Ser c-alpha atom is 36.02A, 
and wherein the variation on the above mentioned distances are ±0.80A, preferably 
5 ±0.70A, more preferably ±0.60A. more preferably ±0.50A, more preferably ±0.40A, or most 
preferably ±0.30A. 

4. A method of producing a variant of a parent JP170 like subtilase. the variant having at 
least one altered property as compared to the parent JP170 like subtilase, the method 

10 comprising: 

a) modelling the parent JP170 like subtilase on the three-dimensional structure of a 
JP170 subtilase to produce a three-dimensional structure of the parent JP170 like sub- 
tilase; 

b) identifying on the basis of the comparison in step a) at least one structural part of the 
15 parent JP170 subtilase, wherein an alteration in said structural part is predicted to re- 
sult in an altered property; 

c) modifying the nucleic acid sequence encoding the parent JP170 subtilase to produce a 
nucleic acid sequence encoding deletion or substitution of one or more amino acids at 
a position corresponding to said structural part, or an insertion of one or more amino 

20 acid residues in positions corresponding to said structural part; 

d) expressing the modified nucleic acid sequence in a host cell to produce the variant 
JP170 subtilase; 

e) isolating the produced subtilase; 

f) purifying the isolated subtilase and 
25 g) recovering the purified subtilase. 

5. A method according to claim 4, wherein the JP170 subtilase on which the parent JP170 
subtilase is modelled in step a) is at least 58% homologous to SEQ ID N0:1. preferably at 
least 60% homologous, more preferably at least 65%, more preferably at least 70%, more 

30 preferably at least 75%, more preferably at least 80%, more preferably at least 85%, more 
preferably at least 90%, more preferably at least 91%, more preferably at least 92%, more 
preferably at least 93%, more preferably at least 94%, more preferably at least 95%, more 
preferably at least 96%. more preferably at least 97%, more preferably at least 98% or 
even more preferably at least 99% homologous to the sequence of SEQ ID NO:1 . 

35 
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6. A method according to claim 4 or 5, wherein the JP170 subtilase on which the parent 
JP170 subtilase is modelled in step a) is defined in accordance with claim 3. 

7. A method of producing a variant of a parent Subtilisin family subtilase, the variant having 
5 at least one altered property as compared to the parent Subtilisin family subtilase, the 

method comprising: 

a) modelling the parent Subtilisin family subtilase on the three-dimensional structure of a 
Subtilisin family subtilase to produce a three-dimensional structure of the parent Subtil- 
isin family subtilase; 

10 b) comparing the three-dimensional structure obtained in step a) to the three-dimensional 
structure of a JP170 like subtilase; 
c) identifying on the basis of the comparison in step b) at least one structural part of the 
parent Subtilisin family subtilase, wherein an alteration in said structural part is pre- 
dicted to result in an altered property; 

15 d) modifying the nucleic acid sequence encoding the parent Subtilisin family subtilase to 
produce a nucleic acid sequence encoding deletion or substitution of one or more 
amino acids at a position corresponding to said structural part, or an insertion of one or 
more amino acid residues in positions corresponding to said structural part; 

e) expressing the modified nucleic acid sequence in a host cell to produce the variant 
20 Subtilisin family subtilase, 

f) isolating the produced subtilase, 

g) purifying the isolated subtilase and 

h) recovering the purified subtilase. 

25 8. A method according to claim 7. wherein the Subtilisin family subtilase on which the par- 
ent Subtilisin family subtilase is modelled in step a) is at least 61% homologous to SEQ ID 
NO:4. preferably at least 63% homologous, preferably at least 65% homologous, more 
preferably at least 70%, more preferably at least 74%, more preferably at least 80%, more 
preferably at least 83%, more preferably at least 90%. more preferably at least 91%, more 

30 preferably at least 92%, more preferably at least 93%, more preferably at least 94%, more 
preferably at least 95%, more preferably at least 96%, more preferably at least 97%, more 
preferably at least 98% or even more preferably at least 99% homologous to the sequence 
of SEQ ID NO:4. 
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9. A method according to any of claim 7 and 8, wherein the JP170 subtiiase of step b) is 
defined in accordance with claim 3. 

10. A method according to any of claims 7-9, wherein the JP170 subtiiase in step b) is at 
5 least 58% homologous with the sequence of SEQ ID NO:1 , preferably at least 60% ho- 
mologous, more preferably at least 65%, more preferably at least 70%, more preferably at 
least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at 
least 90%. more preferably at least 91%, more preferably at least 92%, more preferably at 
least 93%, more preferably at least 94%, more preferably at least 95%, more preferably at 

10 least 96%, more preferably at least 97%. more preferably at least 98% or even more pref- 
erably at least 99% homologous to the sequence of SEQ ID NO;1. 

11. A method of producing a variant of a parent JP170 like subtiiase, the variant having at 
least one altered property as compared to the parent JP170 like subtiiase, the method 

15 comprising: 

a) modelling the parent JP170 like subtiiase on the three-dimensional structure of a 
JP170 like subtiiase to produce a three-dimensional structure of the parent JP170 like 
subtiiase; 

b) comparing the three-dimensional structure obtained in step a) to the three-dimensional 
20 structure of a Subtilisin family subtiiase; 

c) identifying on the basis of the comparison in step b) at least one structural part of the 
parent JP170 like subtiiase, wherein an alteration in said structural part is predicted to 
result in an altered property; 

d) modifying the nucleic acid sequence encoding the parent JP170 like subtiiase to pro- 
25 duce a nucleic acid sequence encoding deletion or substitution of one or more amino 

acids at a position corresponding to said structural part, or an insertion of one or more 
amino acid residues in positions corresponding to said structural part; 

e) expressing the modified nucleic acid sequence in a host cell to produce the variant 
JP170 like subtiiase; 

30 f) isolating the produced subtiiase; 

g) purifying the isolated subtiiase and 

h) recovering the purified subtiiase. 

12. A method according to claim 1 1 , wherein the Subtilisin family subtiiase of step b) is at 
35 least 61% homologous to SEQ ID NO:4, preferably at least 63% homologous, preferably at 
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least 65% homologous, more preferably at least 70%, more preferably at least 74%, more 
preferably at least 80%, more preferably at least 83%, more preferably at least 90%, more 
preferably at least 91%, more preferably at least 92%. more preferably at least 93%, more 
preferably at least 94%, more preferably at least 95%, more preferably at least 96%, more 
5 preferably at least 97%, more preferably at least 98% or even more preferably at least 99% 
homologous to the sequence of SEQ ID NO:4. 

13. A method according to any of claim 11 and 12, wherein the parent JP170 like subtilase 
is defined in accordance with claim 3. 

10 

14. A method according to any of claims 1 1-13, wherein the parent JP170 like subtilase is 
at least 58% homologous with the sequence of SEQ ID NO:1, preferably at least 60% ho- 
mologous, more preferably at least 65%, more preferably at least 70%, more preferably at 
least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at 

15 least 90%, more preferably at least 91%, more preferably at least 92%, more preferably at 
least 93%, more preferably at least 94%, more preferably at least 95%, more preferably at 
least 96%, more preferably at least 97%, more preferably at least 98% or even more pref- 
erably at least 99% homologous to the sequence of SEQ ID NO:1 . 

20 15. A variant subtilase comprising an alteration in one or more positions located at a dis- 
tance of not more than lOA to one of the ion-binding sites of JP170, wherein the positions, 
as specified in SEQ ID NO:1 , located at a distance of not more than 10A to: 
a) ion-binding site 1 are: 183, 184, 185, 186. 187, 188, 189, 191, 192, 193, 194. 195. 196. 
197, 198. 199, 200, 201, 202, 203, 204, 224 and 225, 
25 b) ion-binding site 2 are: 378, 379, 380, 381, 382, 383, 384, 385. 386, 387. 388. 389. 390, 
391 . 392 and 393, 

c) ion-binding site 3 are: 348. 350. 352. 363. 364, 365, 366. 367. 368, 369. 370, 380, 381, 
382, 383, 391, 392, 393. 394, 395. 396, 397. 398, 399, 400, 414. 415. 416. 417, 418, 
419. 420. 

30 

16. A subtilase variant according to claim 15 comprising one or more of the substitutions: 
S193Q.Y; H200D.N; H200D,N+D196N; N390D; N391D; G394N,Q,F,Y,S and W392S.N.Q. 



17. A JP170 like subtilase variant comprising the introduction of a ion-binding site corre- 
35 spending to the Strong Ion-binding site of the BPN' like family subtilases. wherein said van- 
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ant has a deletion of or in the region N79-N82 of SEQ ID NO:1 and subsequent insertion of 
one or more amino acid residues, preferably insertion of the sequence LNNSIQV followed 
by the substitution A45D,N and optionally the substitutions E44P,T and/or R47Q. 

5 18. A JP170 like subtilase variant in which one or more ion-binding sites have been re- 
moved, wherein said variant comprises deletion of or in the region N186-N19g of SEQ ID 
NO:1 and subsequent insertion of one or more amino acid residues, preferably insertion of 
the sequence SSN, and preferably further comprising one or both of the substitutions I7Q 
and V3Y. 

10 

19. A BPN' like subtilase variant in which the ion-binding sites has been removed, wherein 
said variant comprises: 

a) deletion of or in the region A194-L196 (Savinase in BPN' numbering) or a corre- 
sponding region in another BPN' like subtilase and insertion of three or more amino 

16 acid residues, preferably insertion of P209-P217 from JP170 or a corresponding re- 

gion in another JP1 70 [ike subtilase and 

deletion of or in the region L75-L82 (Savinase in BPN' numbering) or a corresponding 
region in said other BPN' like subtilase and insertion of one or more amino acid resi- 
dues, preferably insertion of H83-Y92 from TY145 or a corresponding region in an- 
20 other TY145 like subtilase or- 

b) deletion of or in the region A194-L196 (Savinase in BPN' numbering) or a corre- 
sponding region in another BPN' like subtilase and insertion of three or more amino 
acid residues, preferably insertion of P209-P217 from JP170 or a corresponding re- 
gion in another JP1 70 like subtilase and 

25 deletion of or in the L75-L82 (Savinase in BPN' numbering) or a corresponding region 

in said other BPN' like subtilase and insertion of one or more amino acid residues, 
preferably insertion of N79-K83 from JP170 or a corresponding region in another 
JP170 like subtilase. 

30 20. A JP170 like subtilase variant comprising an alteration in one or more of the following 
positions: 

13. 14. 15, 16. 17, 18, 
37, 38,39, 40,41.42,43. 
47. 48. 49, 50, 
35 57, 58, 59, 
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96.97.98.99.100.101,102.103. 

131.132,133.134. 

152.153 

162, 163. 164, 165, 166. 
5 188, 189, 190. 191. 192, 193. 194, 195. 
210 

234, 235. 236, 237. 238, 239. 240, 241 , 242, 243. 244, 245, 246. 
372, 373, 374. 375, 376, 377. 378, 
387. 388, 389, 390. 391, 392. 
10 406, 407 and 
419. 

21. A JP170 like subtilase variant comprising an alteration in one or more of the following 
positions: 

15 37,38,39,40,41,42, 
57, 58, 59. 60. 
66, 67, 

98,99,100,101.102.103, 

107,108,109.110.111, 
20 188.189,190,191,192,193, 

236, 237, 238, 239, 240, 

326. 327, 328, 329, 330, 331 , 332, 

337, 338, 339, 340. 341, 342. 

355. 356. 357, 358. 359, 360, 
25 372. 373. 374. 375. 376. 377. 

384, 385, 386. 387, 388, 

404, 405, 406, 407, 408, 409. 410. 41 1 . 

22. A subtilase variant according to claim 21 comprising one or more of the modifications: 
30 W240H.Y; G355A,S; S356T,N; T357N.Q,D,E,P; T358S; A359S.T,N.Q and S360T.N. 

23. A variant subtilase comprising an alteration in one or more positions which are within a 
distance of 10A from a Ci2 inhibitor which is bound to the active site of J PI 70, wherein the 
positions, as specified in SEQ ID NO:1 are: 
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29. 30. 31. 32, 64. 65. 66. 67. 68. 69. 70. 71. 72. 93. 96. 97. 98. 100. 101. 102. 103. 104. 
105, 106. 107. 108, 109, .110, 113. 114. 127. 128. 129, 130. 131. 132, 133. 134. 135. 136, 
138, 139. 140. 141. 144. 157. 174. 180. 181, 182. 183, 191. 193. 194. 202. 203, 204. 205, 
206, 207. 211, 223. 224. 225. 226. 234. 235, 236. 237, 238. 239, 240. 241, 249, 250, 251. 
5 252, 253, 254. 255. 256. 257, 258. 

preferably comprising the substitution W129L. 

24. A JP170 like subtllase variant comprising one or more disulfide bridges introduced by 
one or more of the following modifications: G21C/A86C, V26C/A265C. G57C/G105C, 

10 G74C/A229C. Q111CA'143C. G160C/S170C, A286CA/349C. A27C/A122C, A45C/G78C, 
V72C/P258C, G78C/A229C. D98C/G104C. Q111CA'147C. G135C/G167C. R142C/P354C, 
V144C/A178C, G182C/P217C. A183C/G223C, A195C/Y225C. F271C/P279C, 
A287C/A430C, * A293C/S310C. E322C/S428C. S324C/A332C, S327C/P424C, 
D352C/N397C. G255C/T362C. G291CyS314C. A4C/P222C and A27CA/117C. wherein the 

15 positions correspond to the positions in SEQ ID NO:1 

25. A JP1 70 like subtiiase variant comprising an alteration in one or more of the positions 
N76, N316, L381, K246, K9. K313 and K83, preferably comprising one or more of the sub- 
stitutions N79D. N316D. L381D, K246R, K9R, K313R and K83R of SEQ ID NO:1. 

20 • 

26. A JP170 like subtiiase variant comprising an alteration in one or more of the positions 
22, 44. 110. 139. 140, 166. 198, 201. 203. 231. 282. 356. 357 and 378. preferably compris- 
ing one or more of the substitutions: Q22P. E44P, LI 10P. T139P, D140P, S166P I198P, 
V201P, Q203P. S231P. S282P. S356P. T357P and K378P. 

25 

27. A JP170 like subtiiase variant comprising a deletion of the region 311-433. preferably 
deletion of positions 317-433 or 315^33. further comprising one or more of the substitu- 
tions L283N.Q; A290S.N and W306H,Y,K. 

30 28. A subtiiase variant according to claim 27, comprising 

a) deletion of positions 317^33 and the substitutions L283N. A290S and W306H, or 

b) deletion of region 315-433 and the substitutions L283N. A290S and W306H. 

29. An isolated nucleic acid sequence comprising a nucleic acid sequence, which encodes 
35 for the subtiiase variant defined or produced in any of the preceding claims. 
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30. An isolated nucleic acid sequence according to claim 29, wherein the nucleic acid se- 
quence is selected form the group consisting of: 

a) a nucleic acid sequence encoding an enzyme having at least 58% homology with the 
5 amino acid sequence shown in SEQ ID NO:1 , and 

b) a nucleic acid sequence which hybridizes under low stringency conditions, preferably 
under medium stringency conditions, in particular under high stringency conditions, 
with a complementary strand of the nucleic acid sequence encoding an enzyme having 
at least 58% homology with the amino acid sequence shown in SEQ ID NO:1, or 

10 c) a subsequence of any of a) or b) of at least 100 nucleotides. 

31. An isolated nucleic acid construct comprising a nucleic acid sequence as defined in any 
of claims 29-30, operably linked to one or more control sequences capable of directing the 
expression of the polypeptide in a suitable expression host. 

15 

32. A recombinant host cell comprising the nucleic acid construct of claim 31 . 

33. A method for producing the variant defined in any of the preceding claims, the method 
, comprising: 

20 a) cultivating the recombinant host cell of claim 32 under conditions conducive to the 
production of the subtilase variant; and 
b) recovering the variant. . 

34. A detergent composition comprising a JP170 like subtilase variant or a BPN' like subti- 
25 lase variant. 

35. Use of a JP170 like subtilase variant or a BPN' like subtilase variant in cleaning or 
washing applications. 

30 
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Modtaget PvS 
2 1 mi 2003 

ABSTRACT 

The present invention relates to methods for producing variants of a parent JP170 subtilase 
and of a parent BPN' subtilase and to JP170 and BPN* variants having altered properties 
as compared to the parent JP170/BPN' subtilase. 
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Modtaget PVS 



2 1 MRS. 2003 

Figure 1, Alignment of 3D sequences of protease JP170 (mature sequence from App. 1), 
SD-521 (aam50084 from EP 1209233) and protease Y (aay44619 from WO99/67370). 
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1 50 
NDVAR6IVKA DVAQNNYGLY GQGQWAVAD TGLJDTGRNDS SMHEAFRGKI 
NDVARGIVKA DVAQNNYGLY GQGQLVAVAD TGLDTGRNDS SMHEAFRGKI 
NDVARGIVKA DVAQNNFGLY GQGQIVAVAD TGLDTGRNDS SMHEAFRGKI 

51 100 

TAIjYALGRTN NANDPNGHGT HVAGSVX.GNA LNKGMAPQAN LVFQSIMDSS 

TAIiYALGRTN NASDPNGH6T HVAGSVIjGNA LNKGMAPQAN LVFQSXMDSS 

TALYALGRTN NANDPNGHGT HVAGSVLGNA TNKGMMQAN LVFQSIMDSG 

101 150 

GGLGGIiPSNli NTLFSQAWNA GARIHTNSWG APVNGAYTAN SRQVDEYVKN 

GGLGGIiPSNL NTIiFSQAWNA GARIHTNSWG APVNGAYTAN SRQVDEYVRN 

GGLGGLPANL QTLFSQAYSA GARIHTNSWG APVNGAYTTD SRNVDDYVRK 

151 200 

NDMTVLFAAG NEGPNSGTIS APGTAKNATT VGATENYRPS FGSUMDNPNH 

NDMTVLFAAG NEGPNSGTIS APGTAKNAIT VGATENYRPS FGSIADNPNH 

NDMTILFAAG NEGPGSGTIS APGTAKNAIT VGATENLRPS FGSYADNINH 



25 



30 



35 



40 



aaTn500B4 
aay44619 
JP170 



aam50084 
aay44619 
JP170 



aam50084 
aay44 619 
JP170 



aam50084 
aay44619 
JP170 



201 250 

lAQFSSRGAT RDGRIKPDVT APCTFILSAR SSLAPDSSFW ANYNSKYAYM 

lAQFSSRGAT RDGRIKPDVT APGTFILSAR SSLAPDSSFW ANYNSKYAYM 

VAQFSSRGPT RDGRIKPDVM APGTYILSAR SSLAPDSSFW ANHDSKYAYM 

251 300 

GGTSMATPIV AGNVAQLREH FIKNRGITPK PSLIKAALIA GATDVGLGYP 

GGTSMATPIV AGNVAQLREH FIKNRGITPK PSLIKTIALIA GATDVGLGYP 

GGTSMATPIV AGNVAQLREH FVKNRGVTPK PSLLKAALIA GAADVGLGFP 

301 350 

SGDQGWGRVT LDKSLNVAYV NEATALATGQ KATYSPQAQA GKPLKISLVW 

SGDQGWGRVT LDKSLNVAYV NEATALATGQ KATYSFQAQA GKPLKISLVW 

NGNQGWGRVT LDKSLNVAFV NETSPLSTSQ KATYSFTAQA GKPLKISLVW 

351 400 

TDAPGSTTAS YTLVNDLDLV ITAPNGQKYV GNDFSYPYDN NWDGRNNVEN 

TDAPGSTTAS YTLVNDLDLV ITAPNGQKYV GNDFSYPYDN NWDGRNNVEN 

SDAPGSTTAS LTLVNDLDLV ITAPNGTKYV GNDFTAPYDN NWDGRNNVEN 



45 



aam50084 
aay44619 
JP170 



401 433 

VFINAPQSGT YTIEVQAYNV PSGPQRFSLA IVH 

VFINAPQSGT YIIEVQAYNV PSGPQRFSLA IVH 

VFINAPQSGT YTVEVQAYNV PVSPQTFSLA IVH 
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Figure 3, Matrix of homology between subtilases pertaining to the JP170, TY145 and BPN' 
subgroups. The sequences are identified by sequence database accession numbers: 

00: aam50084; Subtilase derived from BaciHus sp. strain SD-521 
5 0: aaw89547; Subtilase derived from Bacilius sp. JP170 
1 : q45681 : Subtilase derived from 5. subtihs (BSTA41 ) 

2: p28842; Psychrophilic subtilfsin derived from Antarctic Badttus strain (BSTA39) 
3: abb77095; Subtilase derived from BacUlus sp. (TY145) 

4: p00783; Subtilase derived from BaciHus subUlis van amylosacchariticus (BSAMY) 
10 5: p29142: Subtilase derived from BaciHus stearothermophitus (BSSJ) 

6: p35835; Subtilase derived from BaciHus subtilis var. natto, (BSNAT) 

7: p07518; Subtilase derived from BaciHus pumiius (B. mesentencus) (BPMES) 

8: p00782; Subtilase derived from Bacillus amyloiiquefadens (BPN') 

9: p00780; Subtilase derived from Bacillus licheniformis (BLSCAR) 
15 10: p41363: Subtilase derived from Bacillus halodurans (BHSAH) 

1 1 : aaw62222; Subtilase derived from Bacillus lentus (BLS147) 

12: p29600; Subtilase derived from Bacillus lentus (BLSAVI, BLS309) 

13: p27693; Subtilase derived from Bacillus alcalophilus (BAALKP) 

14: q99405; Subtilase derived from Bacillus sp. strain KSM-K16 (BSKSMK) 
20 15: p29599; Subtilase derived from Bacillus lentus (BLSUBL). 
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Modtaget PVS 

10321 SEQ list.ST25 ^ 
SEQUENCE LISTING 2 1 MffC 2OQ3 

<110> Novozymes A/S 
<120> Novel subtil ases 
<130> 10321. 000-DK 
<160> 4 

<170> Patentin version 3.2 

<210> 1 

<211> 433 

<212> PRT 

<213> Bacillus sp. DP170 

<220> 

<221> PEPTIDE 

<222> CD.. (433) 

<223> JP170 subtil ase 

<400> 1 

Asn Asp val Ala Arg Gly lie val Lys Ala Asp Val Ala Gin Asn Asn 
15 10 15 

Phe Gly Leu Tyr Gly Gin Gly Gin lie val Ala val Ala Asp Thr Gly 

20 25 30 

Leu Asp Thr Gly Arg Asn Asp Ser Ser Met His Glu Ala Phe Arg Gly 
35 40 45 

Lys lie Thr Ala Leu Tyr Ala Leu Gly Arg Thr Asn Asn Ala Asn Asp 
50 55 60 

Pro Asn Gly His Gly Thr His Val Ala Gly Ser Val Leu Gly Asn Ala 
65 70 75 80 

Thr Asn Lys Gly Met Ala Pro Gin Ala Asn Leu val Phe Gin Ser lie 

85 90 95 

Met Asp ser Gly Gly Gly Leu Gly Gly Leu Pro Ala Asn Leu Gin Thr 

100 105 110 

Leu Phe ser Gin Ala Tyr Ser Ala Gly Ala Arg lie His Thr Asn Ser 
lis 120 125 

Trp Gly Ala Pro val Asn Gly Ala Tyr Thr Thr Asp Ser Arg Asn val 
130 135 140 

Asp Asp Tyr val Arg Lys Asn Asp Met Thr He Leu Phe Ala Ala Gly 
145 150 155 160 

Asn Glu Gly Pro Gly Ser Gly Thr lie Ser Ala Pro Gly Thr Ala Lys 

165 170 175 
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10321 SEQ list.ST25 

Asn Ala lie Thr Val Gly Ala Thr Glu Asn Leu Arg Pro Ser Phe Gly 

180 "185 190 

Ser Tyr Ala Asp Asn lie Asn His Val Ala Gin Phe Ser ser Arg Gly 
195 200 205 

Pro Thr Arg Asp Gly Arg lie Lys Pro Asp Val Met Ala Pro Gly Thr 
210 215 220 

Tyr lie Leu Ser Ala Arg ser Ser Leu Ala Pro Asp ser Ser Phe Trp 
225 230 235 240 

Ala Asn His Asp Ser Lys Tyr Ala Tyr Met Gly Gly Thr Ser Met Ala 

245 250 255 

Thr Pro lie val Ala Gly Asn val Ala Gin Leu Arg Glu His Phe val 

260 265 270 

Lys Asn Arg Gly val Thr Pro Lys Pro Ser Leu Leu Lys Ala Ala Leu 
275 280 285 

lie Ala Gly Ala Ala Asp val Gly Leu Gly Phe Pro Asn Gly Asn Gin 
290 295 300 

Gly Trp Gly Arg val Thr Leu Asp Lys Ser Leu Asn val Ala Phe val 
305 310 315 320 

Asn Glu Thr Ser Pro Leu Ser Thr Ser Gin Lys Ala Thr Tyr ser Phe 

325 330 335 

Thr Ala Gin Ala Gly Lys Pro Leu Lys lie Ser Leu val Trp ser Asp 

340 345 350 

Ala Pro Gly Ser Thr Thr Ala Ser Leu Thr Leu val Asn Asp Leu ASp 
355 360 365 

Leu val lie Thr Ala Pro Asn Gly Thr Lys Tyr val Gly Asn Asp Phe 
370 375 380 

Thr Ala Pro Tyr Asp Asn Asn Trp Asp Gly Arg Asn Asn val Glu Asn 
385 390 395 400 

val Phe lie Asn Ala Pro Gin Ser Gly Thr Tyr Thr val Glu val Gin 

405 410 415 

Ala Tyr Asn val Pro val Ser Pro Gin Thr Phe Ser Leu Ala lie val 

420 425 430 

His 
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<210> 2 " 

<211> 433 

<212> PRT 

<213> Bacillus sp. Y 



<220> 

<221> PEPTIDE 

<222> (1)..C433) 

<223> subtilase Y 

<400> 2 

Asn Asp val Ala Arg Gly lie Val Lys Ala Asp Val Ala Gin Asn Asn 
1 5 10 15 

Tyr Gly Leu Tyr Gly Gin Gly Gin Leu val Ala val Ala Asp Thr Gly 

20 25 30 

Leu Asp Thr Gly Arg Asn Asp Ser ser Met His Glu Ala Phe Arg Gly 
35 40 45 

Lys lie Thr Ala Leu Tyr Ala Leu Gly Arg Thr Asn Asn Ala Ser Asp 
50 55 60 

Pro Asn Gly His Gly Thr His val Ala Gly Ser Val Leu Gly Asn Ala 
65 70 75 80 

Leu Asn Lys Gly Met Ala. Pro Gin Ala Asn Leu val Phe Gin ser lie 

85 90 95 

Met Asp Ser ser Gly Gly Leu Gly Gly Leu Pro Ser Asn Leu Asn Thr 

100 105 110 

Leu Phe Ser Gin Ala Trp Asn Ala Gly Ala Arg lie His Thr Asn Ser 
115 120 125 

Trp Gly Ala Pro val Asn Gly Ala Tyr Thr Ala Asn Ser Arg Gin val 
130 135 140 

Asp Glu Tyr Val Arg Asn Asn Asp Met Thr Val Leu Phe Ala Ala Gly 
145 150 155 160 

Asn Glu Gly Pro Asn ser Gly Thr lie Ser Ala Pro Gly Thr Ala Lys 

165 170 175 

Asn Ala lie Thr val Gly Ala Thr Glu Asn Tyr Arg Pro Ser Phe Gly 

180 185 190 

Ser lie Ala Asp Asn pro Asn His lie Ala Gin phe ser ser Arg Gly 
195 200 205 



Ala Thr Arg Asp Gly Arg lie Lys Pro Asp val Thr Ala Pro Gly Thr 
210 215 220 
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Phe lie Leu ser Ala Arg Ser ser Leu Ala Pro Asp Ser Ser Phe Trp 
225 230 235 240 

Ala Asn Tyr Asn Ser Lys Tyr Ala Tyr Met Gly Gly Thr Ser Met Ala 

245 250 255 

Thr Pro lie val Ala Gly Asn val Ala Gin Leu Arg Glu His Phe lie 

260 265 270 

Lys Asn Arg Gly lie Thr Pro Lys Pro ser Leu lie Lys Ala Ala Leu 
275 280 285 

lie Ala Gly Ala Thr Asp val Gly Leu Gly Tyr Pro Ser Gly Asp Gin 
290 295 300 

Gly Trp Gly Arg val Thr Leu Asp Lys Ser Leu Asn val Ala Tyr Val 
305 310 315 320 

Asn Glu Ala Thr Ala Leu Ala Thr Gly Gin Lys Ala Thr Tyr Ser Phe 

325 330 335 

Gin Ala Gin Ala Gly Lys Pro Leu Lys lie Ser Leu val Trp Thr Asp 

340 345 350 

Ala Pro Gly Ser Thr Thr Ala Ser Tyr Thr Leu Val Asn Asp Leu Asp 
355 360 365 

Leu val lie Thr Ala Pro Asn Gly Gin Lys Tyr val Gly Asn Asp Phe 
370 375 380 

Ser Tyr Pro Tyr Asp Asn Asn Trp Asp Gly Arg Asn Asn Val Glu Asn 
385 390 395 400 

Val Phe lie Asn Ala Pro Gin ser Gly Thr Tyr lie lie Glu Val Gin 

405 410 415 

Ala Tyr Asn val Pro Ser Gly Pro Gin Arg Phe Ser Leu Ala lie val 

420 425 430 



His 



<210> 3 

<211> 433 

<212> PRT 

<213> Bacillus sp. SD-521 



<220> 

<221> PEPTIDE 
<222> (1)..(433) 
<223> Subtilase 5D-521 
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10321 5EQ 1ist-ST25 

<400> 3 

Asn Asp Va1 Ala Arg Gly i1e Va1 Lys Ala Asp va1 Ala Gln Asn Asn 
15 10 IS 

Tyr Gly Leu Tyr Gly Gin Gly Gin Val val Ala val Ala Asp Thr Gly 

20 25 30 

Leu Asp Thr Gly Arg Asn Asp ser ser Met His Glu Ala Phe Arg Gly 
35 40 45 

Lys lie Thr Ala Leu Tyr Ala Leu Gly Arg Thr Asn Asn Ala Asn Asp 
50 55 60 

Pro Asn Gly His Gly Thr His val Ala Gly Ser val Leu Gly Asn Ala 
65 70 75 80 

Leu Asn Lys Gly Met Ala Pro Gin Ala Asn Leu val Phe Gin ser lie 

85 90 95 

Met Asp Ser Ser Gly Gly Leu Gly Gly Leu Pro Ser Asn Leu Asn Thr 

100 105 110 

Leu Phe ser Gin Ala Trp Asn Ala Gly Ala Arg lie His Thr Asn Ser 
115 120 125 

Trp Gly Ala Pro val Asn Gly Ala Tyr Thr Ala Asn Ser Arg Gin val 
130 135 140 

Asp Glu Tyr Val Arg Asn Asn Asp Met Thr val Leu Phe Ala Ala Gly 
145 150 155 160 

Asn Glu Gly Pro Asn ser Gly Thr lie ser Ala Pro Gly Thr Ala Lys 

165 170 175 

Asn Ala lie Thr val Gly Ala Thr Glu Asn Tyr Arg Pro ser Phe Gly 

180 185 190 

Ser Leu Ala Asp Asn Pro Asn His lie Ala Gin Phe Ser Ser Arg Gly 
195 200 205 

Ala Thr Arg Asp Gly Arg lie Lys Pro Asp val Thr Ala Pro Gly Thr 
210 215 220 

Phe lie Leu Ser Ala Arg ser ser Leu Ala Pro Asp Ser ser Phe Trp 
225 230 235 240 

Ala Asn Tyr Asn ser Lys Tyr Ala Tyr Met Gly Gly Thr Ser Met Ala 

245 250 255 

Thr Pro lie val Ala Gly Asn val Ala Gin Leu Arg Glu His Phe lie 
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260 265 270 

Lys Asn Arg Gly lie Thr Pro Lys Pro Ser Leu lie Lys Ala Ala Leu 
275 280 285 

lie Ala Gly Ala Thr Asp val Gly Leu Gly Tyr Pro Ser Gly Asp Gin 
290 295 300 

Gly Trp Gly Arg Val Thr Leu Asp Lys Ser Leu Asn Val Ala Tyr val 
305 310 315 320 

Asn Glu Ala Thr Ala Leu Ala Thr Gly Gin Lys Ala Thr Tyr Ser Phe 

325 330 335 

Gin Ala Gin Ala Gly Lys Pro Leu Lys lie ser Leu val Trp Thr Asp 

340 345 350 

Ala Pro Gly Ser Thr Thr Ala Ser Tyr Thr Leu Val Asn Asp Leu Asp 
355 360 365 

Leu Val lie Thr Ala Pro Asn Gly Gin Lys Tyr val Gly Asn Asp Phe 
370 375 380 

ser Tyr Pro Tyr Asp Asn Asn Trp Asp Gly Arg Asn Asn val Glu Asn 
385 390 395 400 

Val Phe lie Asn Ala pro Gin ser Gly Thr Tyr Thr lie Glu val Gin 

405 410 415 

Ala Tyr Asn val Pro Ser Gly Pro Gin Arg Phe Ser Leu Ala lie Val 

420 425 430 

His 



<210> 4 

<211> 275 

<212> PRT 

<213> Bacillus amylol iquefaciens 



<220> 

<221> PEPTIDE 

<222> Cl)..(275) 

<220> 

<221> PEPTIDE 

<222> (1). .C275) 

<223> Subtilase bpn' 

<400> 4 



Ala Gin Ser Val Pro Tyr Gly val Ser Gin lie Lys Ala Pro Ala Leu 
15 10 15 
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His Ser Gin Gly Tyr Thr Gly Ser Asn val Lys val Ala val lie Asp 

20 25 30 

ser Gly lie Asp Ser ser His pro Asp Leu Lys val Ala Gly Gly Ala 
35 40 45 

Ser Met val Pro Ser Glu Thr Asn Pro Phe Gin Asp Asn Asn ser His 
50 55 60 

Gly Thr His val Ala Gly Thr val Ala Ala Leu Asn Asn Ser lie Gly 
65 70 75 80 

val Leu Gly val Ala Pro Ser Ala Ser Leu Tyr Ala val Lys val Leu 

85 90 " 95 

Gly Ala Asp Gly Ser Gly Gin Tyr ser Trp lie lie Asn Gly lie Glu 

100 105 110 

Trp Ala lie Ala Asn Asn Met Asp val lie Asn Met ser Leu Gly Gly 
115 120 125 

Pro ser Gly Ser Ala Ala Leu Lys Ala Ala val Asp Lys Ala val Ala 
130 135 140 

Ser Gly val val val val Ala Ala Ala Gly Asn Glu Gly Thr Ser Gly 
145 150 155 160 

Ser ser ser Thr val Gly Tyr Pro Gly Lys Tyr Pro ser val He Ala 

165 170 175 

val Gly Ala val Asp ser Ser Asn Gin Arg Ala Ser Phe Ser Ser val 

180 185 190 

Gly pro Glu Leu Asp Val Met Ala Pro Gly val Ser lie Gin ser Thr 
195 200 205 

Leu pro Gly Asn Lys Tyr Gly Ala Tyr Asn Gly Thr Ser Met Ala Ser 
210 215 220 

Pro His Val Ala Gly Ala Ala Ala Leu lie Leu Ser Lys His Pro Asn 
225 230 235 240 

Trp Thr Asn Thr Gin Val Arg ser Ser Leu Glu Asn Thr Thr Thr Lys 

245 250 255 

Leu Gly Asp ser Phe Tyr Tyr Gly Lys Gly Leu lie Asn val Gin Ala 

260 265 270 

Ala Ala Gin 
275 
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Modtaget PVS 



2 1 m. 20Q3 



APPENDIX 1 



REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

HEADER 

COMPND 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK" 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 



Complex of JP170 and CI2A inhibitor 

Contents of asyiranetric unit subtilisin 2x (433 a. a. x 2) 
CI2A inhibitor 2x (a. a. 16-83 and 21-83) 

small peptide (autodigestion product, a. a. KPSLI*, 280 - 284) 
Ca ions 6x, H20 1115 x 

Crystallization conditions: {AMB} Hanging drop vapour diffusion 
method where the drop consists of 2 pi of 15 - 20 mg.ml-l 
protein concentration, lO niM Na cacodylate - HCl buffer, pH 6.5 
and 1 pi of the well solution, 20% w/v PEG 4000, O.l M Hepes 
buffer, pH 7.5, 10% v/v isopropanol, 

XX-XXX-XX xxxx 



3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
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3 
3 
3 
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3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 



REFINEMENT. 
PROGRAM 
AUTHORS 



REFMAC 5.1.24 
MURSHUDOV, VAGIN , DODSON 



REFINEMENT TARGET : MAXIMUM LIKELIHOOD 



DATA USED IN REFINEMENT. 
RESOLUTION RANGE HIGH (ANGSTROMS) 
RESOLUTION RANGE LOW (ANGSTROMS) 
DATA CUTOFF (SIGMA (F) ) 

COMPLETENESS FOR RANGE (%) 
NUMBER OF REFLECTIONS 

FIT TO DATA USED IN REFINEMENT. 
CROSS-VALIDATION METHOD 
FREE R VALUE TEST SET SELECTION 
R VALUE (WORKING** TEST SET) 

R VALUE (WORKING SET) 

FREE R VALUE 

FREE R VALUE TEST SET SIZE (%) 
FREE R VALUE TEST SET COUNT 



90 
96 



1 
19 
NONE 
76.65 

59444 



NULL 
NULL 
0.12256 

0.12256 
NULL 



NULL 
NULL 



FIT IN THE HIGHEST RESOLUTION BIN. 
TOTAL NUMBER OF BINS USED 
BIN RESOLUTION RANGE HIGH 
BIN RESOLUTION RANGE LOW 
REFLECTION IN BIN (WORKING SET) 

BIN R VALUE (WORKING SET) 

BIN FREE R VALUE SET COUNT 
BIN FREE R VALUE 



1 
1 



-999 



20 
901 
950 
940 
149 
0 

000 



NUMBER OF NON-HYDROGEN ATOMS 
ALL ATOMS : 



USED IN 
8694 



REFINEMENT. 



B VALUES. 
FROM WILSON PLOT (A**2) 
MEAN B VALUE (OVERALL^ A**2) 

OVERALL ANISOTROPIC B VALUE. 



NULL 
16.479 



Bll (A**2) 

B22 (A**2) 

B33 (A**2) 

B12 (A**2> 



0 
0 
-0 
0 



05 
06 
11 
00 



I 
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REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMTUIK 


3 


REMARK 
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REMARK 
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REMARK 
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REMARK 
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REMARK 


3 


REMARK 
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REMARK 
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REMARK 
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REMARK 
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REMARK 
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REMARK 
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REMARK 
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REMARK 
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REMARK 
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REMARK 
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REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


REMARK 


3 


CISPEP 


1 


CISPEP 


2 


CISPEP 


3 



B13 
B23 



(A**2) 
{A**2) 



0 
0 



00 
00 



ESTIMATED OVERALIj COORDIKATE ERROR. 
ESU BTISED ON R VALUE 
ESU BASED ON FR££ R VALUE 
ESU BASED ON MAXIMUM LIKELIHOOD 
ESU FOR B VALUES BASED ON MAXIMUM LIKELIHOOD 



{A) 
(A) 
(A) 
(A**2) 



0.151 
NULL 

0. 052 

1. B2B 



CORRELATION COEFFICIENTS. 

CORRELATION COEFFICIENT FO-FC 
CORRELATION COEFFICIENT FO-FC 



FREE 



0.969 
NULL 



RMS DEVIATIONS FROM IDEAL VALUES 




COUNT 


RMS 


WEIGHT 


BOND LENGTHS REFINED ATOMS 


(A) ; 


7733 ; 


' 0 


. 014 ; 


0. 


021 


BOND LENGTHS OTHERS 


(A) : 


€857 ; 


0 


. 001 ; 


0. 


020 


BOND ANGLES REFINED ATOMS (DEGREES) ; 


10540 i 


r 1 


.478 ; 


1. 


936 


BOND ANGLES OTHERS (DEGREES) : 


: 15972 , 


; 0 


. 815 ; 


► 3. 


000 


TORSION ANGLES, PERIOD 1 (DEGREES) : 


997 , 


; 15 


.784 \ 


• 5. 


ODD 


CHIRAL- CENTER RESTRAINTS (A 


**3) 


1197 . 


; 0 


.106 , 


• 0. 


200 


GENERAL PLANES REFINED ATOMS 


(A) : 


8819 ; 


0 


.007 ; 


0. 


020 


GENERAL PLANES OTHERS 


(A) : 


1500 ; 


• 0 


. 008 ; 


> 0. 


020 


NON-BONDED CONTACTS REFINED ATOMS 


(A) : 


1552 , 


; 0 


. 221 ; 


0. 


300 


NON~ BONDED CONTACTS OTHERS 


(A) : 


8282 , 


■ 0 


.265 ; 


• 0. 


300 


NON- BONDED TORSION OTHERS 


(A) : 


4417 , 


r 0 


.089 ; 


• 0. 


500 


H-BOND (X...Y) REFINED ATOMS 


(A) : 


1391 . 


r 0 


.198 \ 


• 0. 


500 


POTENTIAL METAL- ION REFINED ATOMS 


(A) : 


25 , 


; 0 


.145 ; 


r 0. 


500 


SYMMETRY VDW REPINED ATOMS 


(A) : 


10 , 


f 0 


.129 ; 


r 0. 


300 


SYMMETRY VDW OTHERS 


(A) ; 


: 57 , 


; 0 


.268 \ 


■ 0. 


300 


SYMMETRY H-BOND REFINED ATOMS 


(A) : 


87 , 


; 0 


.272 . 


r 0. 


500 



ISOTROPIC THERMAL FACTOR RESTRAINTS 



MAIN-CHAIN BOND REFINED ATOMS 
MAIN- CHAIN ANGLE REFINED ATOMS 
SIDE -CHAIN BOND REFINED ATOMS 
SIDE-CHAIN ANGLE REFINED ATOMS 



(A**2) 
(A**2) 
(A**2) 
(A**2) 



COUNT 

4985 

8031 

2746 

2509 



RMS 

0. 697 
1.205 

1. 963 
3.180 



NCS RESTRAINTS 
NUMBER OF NCS 



TLS DETAILS 
NUMBER OF TLS 



STATISTICS 
GROUPS : NULL 



GROUPS 



NULL 



BULK SOLVENT MODELLING. 
METHOD USED : BABINET MODEL WITH 
PARAMETERS FOR MASK CALCULATION 



MASK 



VDW PROBE 
ION PROBE 
SHRINKAGE 



RADIUS 
RADIUS 
RADIUS 



1 
0 
0 



OTHER REFINEMENT REMARKS: 
HYDROGENS HAVE BEEN ADDED 



40 
80 
80 



IN THE RIDING POSITIONS 



GLY 
ALA 
PHE 



A 
A 
A 



163 
171 
191 



PRO 
PRO 
GLY 



A 
A 
A 



164 
172 
192 



0 
0 
0 



00 
00 
00 



WEIGHT 
•1,500 
2.000 

3 . 000 
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ATOM 3796 C VAL B 82 -0.583 33.532 -1.371 1.00 20.69 

ATOM 3797 O VAL B 82 -0.235 33.255 -0.221 1.00 19.36 

ATOM 3798 N GLY B 83 -0.469 32.694 -2.389 1.00 19-91 

ATOM 3799 CA GLY B 83 0.3B2 31.525 -2.331 1.00 20.16 

ATOM 3800 C GLY B 83 -0.236 30.295 -2,955 1.00 20.11 

ATOM 3801 O GLY B 83 -1.416 30.328 -3.319 1.00 20.36 

ATOM 3802 OXT GLY B 83 0.468 29.294 -3.104 1-00 18.79 
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